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1. Introduction 

One of the most remarkable known quantum field theories in four dimensions is the 
N = 4 supersymmetric Yang-Mills theory. This theory has the largest possible number of 
supersymmetries for a four-dimensional theory without gravity. It is believed to be exactly 
finite and conformally invariant. 

A long-standing conjecture asserts that this theory has a symmetry exchanging strong 
and weak coupling and exchanging electric and magnetic fields. This conjecture originated 
with work of Montonen and Olive, who [|J proposed a symmetry with the above properties 
and also exchanging the gauge group G with the dual group G (whose weight lattice is 
the dual of that of G). It was soon realized that this duality was more likely to hold 
supersymmetrically and in fact the N = 4 theory was seen to be the most likely 
candidate |3|] since only in that case the elementary electrons and monopoles have the 
same quantum numbers. (It has recently been argued that an analog of Montonen- Olive 
duality does hold for a certain N = 2 theory with matter hypermultiplets Q.) 

While Montonen- Olive duality was originally proposed as a Z2 symmetry involving 

the coupling constant only, the N = 4 theory has one more parameter that should be 

included, namely the 9 angle. As was originally recognized in lattice models |5]|| and 

string theory when the 9 angle is included, it is natural to combine it with the gauge 

coupling constant g in a complex parameter 

9 4ni ^ „. 

Z7T g z 

Then the Z2 originally proposed by Olive and Montonen can be extended to a full SL(2, Z) 

symmetry acting on r in the familiar fashion 

ar + b 
ct + cf 

here a, 6, c, and d are integers with ad — be = 1, so that the matrix 
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(1.3) 



has determinant 1. Indeed, SL(2, Z) is generated by the transformations 

) 1 
1 



S=[°, I) (1-4) 
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T=( I ]). (1.5) 



and 

T=( 

Invariance under T is the assertion that physics is periodic in 9 with period 27r, and S 
is equivalent at 9 = to the transformation g 2 /4n — > (gr 2 /47r) -1 originally proposed by 
Montonen and Olive. 

The difficulty in testing a conjecture that relates weak coupling to strong coupling is, 
of course, that it is difficult to know what happens for strong coupling. Until now, tests 
of this conjecture have involved quantities that have no quantum corrections and can be 
determined exactly at the semiclassical level; one then checks that the semiclassical results 
or formulas are invariant under SX(2,Z). For instance, the masses of stable particles 
that saturate the Bogomol'nyi-Prasad-Sommerfeld (BPS) bound are given exactly by the 
semiclassical result, as one can deduce from the structure of the super symmetry algebra. 
As explained by Sen in a recent survey |§, the SX(2, Z) symmetry predicts the existence of 
BPS-saturated multimonopole bound states. Sen verified this |1(| for the case of magnetic 
charge two in an elegant calculation that gave some of the most striking new evidence 
in many years for the strong-weak duality conjecture. The topological aspects of the 



generalization to arbitrary magnetic charge have been demonstrated by Segal [ I I 



Relation To String Theory 

The conjectured £X(2, Z) symmetry of the N = 4 theory gets further appeal from 
the proposal that this is actually a low energy manifestation of a similar symmetry in 



string theory. To be precise, the conjecture [12 involves the compactification of the het- 



erotic string theory on a six-torus, which gives a four-dimensional theory with N = 4 
supersymmetry; the expectation value of the dilaton multiplet determines the low energy 
parameters. This theory is conjectured to have SX(2, Z) symmetry acting on the dilaton 
multiplet. In this context, SL(2, Z) symmetry has been called S'-duality; it is strikingly 
similar to the usual R <-> 1/R symmetry of toroidal compactification of string theory known 
as T-duality. The analogy helped motivate the original speculations about S'-duality in 
string theory. T-duality has a conjectured generalization, mirror symmetry |13|,|14]], for 



which there is ample evidence [15-|T7[|. In this generalized sense T-duality relates compact- 



ification on one manifold to compactification on another manifold. It has been suggested 
that string-fivebrane duality []18| , |T9[1 could exchange S and T-duality, perhaps shedding 
light on the former; see again 0. 

The existing evidence for S-duality in string theory has been surveyed in 0. It 
largely concerns the existence of quantities (like the low energy effective action for the 
axion-dilaton system) that are unaffected by quantum corrections and whose SL(2, Z) 
symmetry can be verified at tree level. Also, there is the fascinating occurrence of duality 
of root systems both in the heterotic string and in the Montonen-Olive conjecture. 

The extension of SL(2, Z) from field theory to string theory greatly increases its 
potential significance. In field theory, this strong-weak duality would appear to be, at 
most, a curious phenomenon applying to very special field theories. In string theory, 
however, if this symmetry appears under toroidal compactification, then - as the radius 
of the torus is arbitrary - on thinking about the large volume limit it would appear that 
the SL(2, Z) must be a manifestation of some property of the uncompactified theory (just 
as the same is believed to be true for T-duality, which appears after compactification). 
And if S-duality comes from a property of the uncompactified theory, it must have some 
significance after any compactification. Thus, if valid in string theory, S'-duality should 
have some implications not just for special models but for the real world. 

In field theory, one has to be careful in calling SL(2, Z) a "symmetry"; it not only 
changes the coupling parameters but exchanges the gauge group G with the dual group 
G. Only a subgroup of SL(2, Z) maps G to itself. In toroidal compact ificat ions of string 
theory, SL(2, Z) is (if valid at all) a normal, albeit for the most part spontaneously broken, 
symmetry (in fact a gauge symmetry ||) with the unusual property of being valid only 
quantum mechanically. That last fact suggests that S'-duality may have a message that is 
now hard to perceive, especially since upon compactification to three dimensions, S and 
T duality are apparently combined into a bigger symmetry group [p0[ , perhaps giving a 
unification of % and a' - the parameters that control quantum mechanical and stringy 
corrections. 

Perhaps, as in the case of T-duality and mirror symmetry, examples will be found 



of string theories in which the strong coupling limit of one compactification is the weak 
coupling limit of another. A proper general statement may well be simply that dilaton 
moduli space is compact and smooth except for singularities that correspond to weak 
coupling limitsB; in particular, the same string theory might have several very different- 
looking weak coupling limits. 

Testing S -duality For Strong Coupling 

Despite the substantial evidence for S'-duality, it is frustrating that none of the existing 
tests of this strong coupling symmetry really involve the strong coupling behavior. The 
purpose of the present paper is to fill this gap by developing a true strong coupling test of 
S'-duality. 

At the same time, the test we carry out will involve computations in the vacuum of the 
A = 4 theory in which the non-abelian gauge symmetry is unbroken. Existing tests have 
generally involved computations in vacua in which the gauge symmetry is spontaneously 
broken to an abelian subgroup. 

To test S'-duality for strong coupling involves finding quantities that we can calculate 
for strong coupling. Most strong coupling calculations are out of reach, but exceptions 
are sometimes provided by quantities that can be interpreted as correlation functions in a 
topological field theory. A = 2 supersymmetric Yang-Mills theory has a twisted version 
that is a topological field theory |21]] and the same is true for A = 4 P2"| . The twisted 



theories coincide with the physical theory on a flat manifold or more generally on a hyper- 
Kahler manifold, but not in general. 

In this paper, we will consider just one of the twisted theories, and show that its 
partition function for gauge fields in a given topological class is the Euler characteristic 
of instanton moduli space. Thus, to simplify a bit, if is the Euler characteristic of the 
moduli space of /c-instantons, and q = exp(27rir), then the partition function is Z(q) = 

a,kq k We will test S'-duality by examining the modular properties of the function Z(q), 
on various manifolds, mainly for gauge groups SU{2) and SO(3). For computing Z(q) on 
various four-manifolds, we rely almost entirely on known mathematical results. For AT3, 



1 And perhaps also orbifold singularities where some discrete symmetries are unbroken. 



we rely on constructions by Mukai and others [p3|-p5| ; for CP we use formulas of Yoshioka 



and Klyachko |26| - |28|1 ; for the case of blowing up a point in a Kahler manifold we use a 



formula of Yoshioka [26|; and for ALE spaces we use formulas of Nakajima ||29|| . All of 



these formulas give functions with modular properties. 

The organization of this paper is as follows. In §2 we discuss the topologically twisted 
N = 4 Yang-Mills theory and explain why, in favorable conditions, its partition function is 
the Euler characteristic of instanton moduli space. (More generally, one needs to consider 
a more complicated equation that we describe in §2.) This section includes a self-contained 
review of the relevant aspects of topological theories. 

In §3 we formulate more precisely the predictions of S'-duality that we are going to 
test. We discuss some subtleties that may arise when discussing S'-duality on a compact 
four-manifold; they have the effect that the partition function can transform as a modular 
form rather than a modular function and that it can differ by an overall factor of q~ s from 
the generating function of instanton Euler characteristics. We then go on to sharpen the 
5-duality conjecture to allow for non-abelian electric and magnetic flux PD| . (While this 
paper was in gestation, we received a paper by Girardello, Giveon, Porrati, and Zaffaroni, 
who similarly incorporated the discrete flux on the four-torus in the context of S'-duality 
3lJ[.) We propose a transformation law for the partition function computed with fixed 



electric and magnetic fluxes. We also determine some general constraints on exponents 
and singularities. 

In §4 we begin testing the predictions. We first consider the case in which space-time 
is a K3 surface; this is a particularly nice case, as it is a hyper-Kahler manifold so the 
physical and twisted models coincide. Enough is known about instanton moduli spaces 
on K3 for sufficiently many instanton numbers to allow a strong test of the S'-duality. As 
most of the instanton moduli spaces on K3 can be explicitly exhibited as orbifolds, one 
can use techniques from orbifold constructions to compute its Euler characteristic. We not 
only find that the partition function of N = 4 super Yang-Mills theory on K3 is a modular 
object, as expected from S'-duality, but in the midst of this calculation we unexpectedly 
encounter the partition function of bosonic strings! 

We next consider CP 2 using the formulas of Yoshioka and Klyachko. We find that 



the topological partition function has interesting modular properties but does not quite 
converge well enough to be modular. A natural non-holomorphic modification makes it 



modular [p2|j3^| . Presumably, what is going on is that there is a holomorphic anomaly 



somewhat analogous to the one that arises ||34j| in certain two dimensional models. The 



two examples of K3 and CP 2 enable us to fix some important unknown constants that 
appeared in §3. 

Next we consider the case of the blow-up of a point on a four-manifold (i.e. gluing in a 
copy of CP 2 with opposite orientation) using the formula of Yoshioka |26| ; it turns out that 
under the blow-up the partition function essentially is multiplied by a character of the two- 
dimensional WZW model of SU(2) at level 1! We have no idea why two-dimensional field 
theory makes this appearance, but at any rate the function involved is certainly modular. 
(We propose a natural generalization of the blowing-up formula for groups of rank bigger 
than one, again involving WZW characters and satisfying some non-trivial checks.) 

Finally, following results of Nakajima, we consider the U(k) and SU(k) theories on 
ALE spaces, which are non-compact hyper-Kahler manifolds. Nakajima's formulas once 
again involve two-dimensional current algebra in a beautiful and unexpected way. His 
results appear not just to incorporate S'-duality but to go beyond what one would expect 
from field theoretic S'-duality; unfortunately we do not understand the predictions of S- 
duality on noncompact manifolds precisely enough to fully exploit them. 

In §5 we attempt to generalize these results using physical arguments. Imitating a 
strategy used recently in Donaldson theory p5 |, we restrict ourselves to Kahler manifolds 



with h 2 ' 7^ 0, and make a massive perturbation of the twisted N = 4 theory that preserves 
part of the topological symmetry. We propose a formula for the partition function of the 
N = 4 theory on these manifolds which beautifully obeys the various constraints. 

We conclude in §6 with some comments on the relation to string theory. On the one 
hand, we summarize the facts concerning the odd appearances in §4 of formulas from two- 
dimensional rational conformal field theory. And we pursue further the peculiar appearance 
of the (left-moving) bosonic string partition function in §4; we explain that a similar 
computation using the four-torus instead of K3 gives the (left-moving) oscillator states of 
a fermionic string. In a way these mysterious observations generalize an observation by 
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Gauntlett and Harvey for the string ground state |3(J. 

2. Twistings of Supersymmetric Yang-Mills Theory 

Before describing how topological field theories can be constructed by twisting of 
N = 4 super Yang-Mills theory, let us recall the situation for N = 2 ||21|| . N = 2 super 
Yang-Mills has a global symmetry group SU (2)j. The supercharges Q a i and Q dj - transform 
in the two-dimensional representation of this group. Working on a flat R 4 , the rotation 
group K = SO(4) is locally SU(2) L x SU(2) R . Under SU(2) L x SU(2) R x SU{2) h the 
supercharges transform as (2, 1, 2) © (1, 2, 2). 

Now, as long as we are on flat R 4 , we could find an alternative embedding of K 
in SU{2)l x SU(2)r x SU(2)i and declare this to be the rotation group. This can be 
done by leaving SU(2)l undisturbed and replacing SU(2)r by a diagonal combination of 
SU(2)r x SU(2)i; we will call this diagonal combination SU{2)' R . The modified rotation 
group is hence K' = SU(2)l x SU{2)'h. When one departs from flat R 4 , either by 
considering a curved metric or by working on a different four-manifold altogether, one 
uses not the usual stress tensor Tjj but a modified stress tensor T[- chosen so that the 
corresponding rotation operators are in fact K' . The "twisted" theory so defined therefore 
coincides with the physical theory only when the metric is flat. 

Under K', the supercharges transform as (2, 2) © (1, 3) © (1, 1). Let us call the (1, 1) 
element Q. Its claim to fame is that it obeys Q 2 — 0, and (roughly because it has spin zero 
in the sense of K'), under the twisted coupling to gravity, it is conserved on an arbitrary 
four-manifold M. Moreover, one finds that T'ij = {Q, A^ } for some A; this means that if 
Q is interpreted as a BRST-like operator, only Q-invariant observables being considered, 
then the coupling to the gravitational field is a BRST commutator and the theory is a 
topological field theory. 

The topological field theory constructed this way is quite interesting, being equivalent 
to Donaldson theory. 

Generalization To N = 4 
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Now we come to the generalization to N = 4. From the above, it is clear that the main 
point is to pick an homomorphism of K into the global symmetry group of the theory to get 
a twisted Lorentz group K' . N = 4 supersymmetric Yang-Mills theory in four dimensions 
has global symmetry group £77(4). The possible homomorphisms of K in 577(4) can 
be described by telling how the 4 of £77(4) transforms under K. Thus they correspond 
simply to four-dimensional representations of K. To get a topological field theory, we need 
a representation such that at least one component of the supercharge is a K' singlet. Up 
to an exchange of left and right, there are three four-dimensional representations with this 
property: (i) (2, 2); (ii) (1, 2) © (1, 2); (iii) (1, 2) © (1, 1) © (1, 1). Correspondingly, there 
are three topological field theories that one can consider. The theories corresponding to 
the last two of those three representations were discussed some years ago by Yamron |2!| . 
Our interest in the present paper will be in the theory determined by the representation 
(1,2) © (1,2). Note that this embedding of K in £77(4) commutes with a subgroup 
F = ££7(2) of 577(4) that transforms the two copies of (1,2). This becomes a global 
symmetry of the twisted theory. 

The supercharges, which under K x SU{4) transform as (2, 1, 4) © (1, 2, 4), transform 
under K' © F as (2, 2, 2) © (1, 3, 2) © (1, 1, 2). Thus there are two K' singlets, say Q and 
Q'. They obey Q 2 = (Q') 2 = {Q, Q'} = 0. They transform as a doublet of F. 

The gauge bosons of the N = 4 theory are, of course, scalars under £77(4). The left 
and right handed fermions transform under K x 577(4) like the Q's - so under K' © F, 
they transform as (2, 2, 2) © (1, 3, 2) © (1, 1, 2). If we ignore F, this is just two copies 
of the K' representation that appears in Donaldson theory. The scalars of the N = 4 
theory transform in the six dimensional representation of K; under K' x F they transform 
as (1, 1, 3) © (1, 3, 1). In other words, in the twisted theory these fields turn into three 
scalars (a triplet of F) and a self-dual antisymmetric tensor (a singlet of F). Of course, 
the fermions and scalars all take values in the Lie algebra of the gauge group. 

With the scalars denoted as v y , y = 1 . . . 6, the bosonic part of the Lagrangian of the 
N = 4 theory is 




(2.1) 
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plus a possible theta term 

^ J TrF A F. (2.2) 

After twisting, there is an important curvature coupling term that will emerge later. 

If the twisted theory is formulated on a Kahler manifold X , some special features arise. 
The holonomy of the Riemannian connection on a Kahler manifold is SU(2)l x U(1)r (for 
a generic oriented Riemanian four-manifold it is SU(2)l x SU(2)r). The twisting involves 
the embedding in the global symmetry group SU(4) of only U(1)r, not SU(2)r. The 
global symmetry group is the subgroup of SU(4) that commutes with the embedding of 
U(1)r; it is isomorphic to 577(2) x 577(2) x £7(1). Four supercharges instead of two are 
invariant under the twisted holonomy group SU (2)^x11 (1)'r, so the twisted N = 4 theory 
on a Kahler manifold has four fermionic symmetries instead of two. Indeed, one of these 
originates from each of the four underlying super symmetries of the N = 4 model. 

2.1. The Euler Class 

The goal in the rest of this section is to explain why the partition function of the 
twisted N = 4 theory that was just described is, under suitable conditions, the Euler 
characteristic of instanton moduli space. This will also help us understand the deviations 
from this formula that occur under certain conditions. Actually, the general background 
is not new [p7| - ^0| and the specific issues that lead to the Euler characteristic have also 
been discussed previously |3T] - |3"4|] , but we will attempt to develop the subject here in such 
a way as to make this paper as self-contained and readable as possible. To do so, we will 
first explain some very simple models, beginning with finite dimensional systems. 

To begin with, we consider a compact! oriented manifold M of dimensions d = 2n, 
endowed with a real oriented vector bundle V of rank d. We choose on V a metric g a b (we 
write (v, w) = g a bV a w b ) and an SO(d) connection A. We will consider a system with a 
topological symmetry Q (Q 2 = 0) that carries charge one with respect to a "ghost number" 
operator U. There will be two multiplets. The first consists of local coordinates u l on M 

2 Compactness is not necessary if the section s, introduced presently, has a suitable behavior 
at infinity A non-compact manifold with a suitable s will, eventually, be the main situation we 
study. 
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(of U = 0) together with fermions if) 1 tangent to M, of U = 1. The transformation laws 
are 

5u l = ieifj' 1 

(2.3) 

5tp l = 0. 

Here e is an anticommuting parameter. We also define 5q to be the variation with e 
removed, so for instance 5q4> 1 = . The second multiplet consists of an anticommuting 
section x a °f V °f U = — 1, and a commuting section H a of V; H has Z7" = 0. The 
transformation laws are 

5 X a = eH a - e5 u i A i a bX b 



5H a = eS^AftH 13 - ^Squ* l 8 u j bX l 



(2.4) 



Of course, one could here substitute for Squ 1 from ( |2.3|) ; we have written the formula in this 
way to indicate that it is a covariantized version of the more naive Sx a = eH a : 5H a = 0. 
The Lagrangian is to be L = SqW for a suitable W. Such a W is 

W=^\x,H + 2is) (2.5) 

with s an arbitrary c-number section of V and A a small positive real number. We get 

L = ±{H,H- 2is) + ±g ahX a ^-r - ±-F ljab ^^ X a X b - (2-6) 



Now we want to do the integral 

Z=\^-j J dudipdxdHe'^. (2.7) 

(The factors of 2n correspond to the standard factor of 1/ \/2tx for every bosonic variable 
in the Feynman path integral.) This integral is guaranteed to be a topological invariant - 
that is, to depend only on M and V - since the derivative of L with respect to any of the 
other data (A, g, A, and s) is of the form {Q, . . .}. We will call it the partition invariant 
of the system. 

As a first step to evaluate the integral, we integrate over H, getting 



\ \ 2 f /fssl 1 (9s & 1 

—J J dudijdx exp ( -ijJ- - -g ahX «—^ + — Fijab Wx a X° ) • C2-> 

10 



To proceed, we first consider the case s = 0. The integral is then done by expanding the 
four fermi interaction, giving 

f Pf (F A F A ... A F) 

with Pf (F A . . . A F) the Pfafhan on the a, b indices. The curvature integral in ( |2.9| ) is 
a standard integral representation for a topological invariant that is known as the Euler 
class of V (integrated over M); we will denote it as x{Y)- I* 1 case ^ = TM is the tangent 
bundle of M, x(V) coincides with the Euler characteristic x(M) of M. 

Now we consider the case of s 7^ 0, which is much closer to our general interests in 
this paper. The main idea is to consider the behavior for A — > 0. In this limit, the integral 
is dominated by contributions from infinitesimal neighborhoods of zeroes of s. For the 
first basic case, we suppose that s has only isolated and non-degenerate zeroes P a . In that 
case, near each zero one can choose local coordinates on M and a trivialization of s so that 
s a = f a u a (no sum over a here and in similar formulas below), with some real numbers f a . 
Higher order terms are irrelevant for small A. Then the contribution of a particular zero is 

^) 1 n / dua d ^ a d * a ex p {-^p- + \f^ a * a ) = n = ±l ( 2 - iq ) 



The answer is thus 



where 



Pa 



sign(detl^) ). (2.12) 



du 1 

Since the integral is independent of s, we learn by comparing to the result for s = that 
the Euler class of a bundle can be computed by counting the zeroes of a section, weighted 
with signs; this is a standard theorem. 

In our applications, we will actually need a hybrid of the two cases considered above. 
Suppose that s = on a union of submanifolds M a of M, of dimensions d a . We assume 



3 Another way to describe this is that e a measures the relative orientation of TM\p a and 
V|p a ; regarding ds\p a as an isomorphism between those spaces, t a = ±1 depending on whether 
the orientations of the two spaces agree or disagree under this isomorphism. 
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that the behavior of s in the normal directions to M a is non-degenerate; this means that 
locally one can pick coordinates u l , i = 1 . . . d — d a in the directions normal to M a and a 
trivialization of V such that 

s a = f a iu\ for i, a, = 1 . . . d — d a 

(2.13) 

s a = 0, for a > d — d a 

where f a i (with a, i = 1 . . .d — d a ) is an invertible (d — d a ) x (d — <i Q ) matrix. It is 
convenient to regard / as a d x d matrix whose other components are zero. Looking at the 



Lagrangian (|2.6|) , we see that this / is the "mass matrix" for the fermions near M a . The 
"massless components" of ift are those that are tangent to M a . The massless components 
of x are in the above trivialization the x a f° r a > d — d a . That trivialization is valid only 
locally; globally the massless components of x are sections of a vector bundle V a over M a . 

We will adopt the following terminology: we call the x a (which have ghost number 
— 1) "antighosts," and we refer to V a as the vector bundle of antighost zero modes. 

Now we will evaluate the integral for Z, in the limit of small A. The integral will be a 
sum of contributions from the various M a . These contributions can be evaluated as follows. 
Fixing a particular M a , the integral over the "massive modes," which roughly are those 
"normal" to M a , proceeds precisely as in the derivation of ( |2.10| ). One gets a Gaussian 
integral with bosons and fermions canceling up to sign, giving a factor of e a = ±1. @ Then 
one has the integral "tangent" to M a . The Lagrangian ( |2.6|) has the property that if one 
sets all of the "massive" fields to 0, one gets a Lagrangian of the same type, but with M 
replaced by M a , V by V a , and s by 0. The integral over the "massless" fields is thus an 
integral of the type that we have already seen in getting (|2.9| ), and so equals x(Ya)i the 



Euler class of V a (integrated over M a ). The final result is then 

Z = J2^x(V a ). (2.14) 

a 

Another way to obtain the same result is to perturb s to a nearby section 7 that 
has isolated zeroes. For instance, on each M a , pick a section s a of V a with only isolated 



4 In the spirit of the last footnote, one can describe e as a factor which, given orientations of 
TM and V, produces a relative orientation of TM a and V a . This is the data needed to fix the 
sign of x(^q) below. 
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zeroes. Regard s a as a section of V\M a ', extend it in an arbitrary fashion to a section of V 
that vanishes outside a small tubular neighborhood of M a (a neighborhood disjoint from 
Mp for (3 ^ a). Then s = s + e J2 a s a for sufficiently small e vanishes precisely on points 
on M a on which s a = 0. This gives a check on ( |2.14| ) in the following sense: evaluating 
( |2. 14j) by using the zeroes of s a to compute x(Va) one gets the same result that one gets 
by evaluating ( |2.11| ) for the section s. 

2.2. Counting Solutions Of An Equation 

It is illuminating to consider a very special case of this: counting the solutions of an 
equation. We will start with an elementary example. Consider a single real variable u and 
an equation 

u 2 -a = 0, (2.15) 

with real a. Obviously, the number of solutions is not a topological invariant; there are 
two for positive a, and none for negative a. 

To put this in the above format, take M to be the w-axis, V to be a one dimensional 
trivial bundle, and s to be the section of V 

s(u)=u 2 -a. (2.16) 

We want to compute the partition invariant of this system. According to ( |2.10|) , the result 
is 

Z = ^e Q , (2.17) 

a 

with a running over the zeroes of s, and e the sign of df /du at a given zero. For a < 0, 
Z = since there are no zeroes. For a > 0, there are two zeroes, at u = ±^/a. Z still 
vanishes since e = ±1 for u = ±yfa. 

The moral is, of course, that while the total number of solutions of an equation is not 
a topological invariant, the number of solutions weighted with signs (or in general, with 
multiplicities, if one encounters degenerate solutions) is such an invariant. 

Suppose, however, that we want to find an integral formula that counts without signs 
the total number of solutions of an equation. This cannot really be done, as is clear from 
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the above example. But there is a partial substitute which we will explain first in the 
above special case. Double the degrees of freedom, adding a new variable y and replacing 
the u axis by the u — y plane. Take V to be a two dimensional trivial bundle, and let s be 
the section of V given by the two functions 

2 2 

si = u — a — y 

(2.18) 

s 2 = 2uy. 

Now we consider the system of equations s = 0, that is si = s 2 = 0. The partition 
invariant Z is a topological invariant of this system which is the number of solutions of the 
equations weighted by sign. Let us compute this number - which of course can be defined 
as above by an integral formula - for various a. 

Suppose first that a > 0. The equations si = s 2 = have the two solutions u = ±y/a, 
y = 0. The result of including y is that e = 1 for both solutions. Indeed, the determinant 
in (|2.12[ ) is always positive (as dsi/du and ds 2 /dy have the same signs at each root of the 
equations). So Z = 2. 

What about a < 0? There are again two solutions, now at u = 0, y = ±y/—a. One 
can verify that again the contributions are +1 and so Z = 2. 

The reason that, for y = 0, e = +1 at each zero is that 

s 2 = y^ + 0(y 2 ). (2.19) 

This ensures that for zeroes with y = the determinant in ( |2.12| ) is positive. The conclusion 
is that not for all sections s, but for those sections that obey ( 2.19j ) and vanish only at y = 0, 



Z is equal to the total number of solutions of the "original equations" si(u, y = 0) = 0.@ 
The General Story 

The generalization of this is as follows. Suppose that one is interested in counting 
without signs the solutions of some equations 

s a (ii*) = 0, a= l...ra (2.20) 



5 In this statement and similar statements below, we assume the zeroes are nondegenerate; 
otherwise one must count the multiplicities. 
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in n variables u 1 ...u n . The u l are, in general, coordinates on some manifold M and, 
globally, the s a define a section of a vector bundle V. Introduce another set of variables 
y a , a = 1 . . .n, and extend s a to arbitrary functions s a (u % : yb) such that! 

s a (u\0) = s a (u l ) (2.21) 

Let hi(u J ,y a ) be any n additional functions such that 

' du 



hl (u\y b ) = J2ya^f- + 0(y 2 ). (2.22) 



Consider the system of equations 

s a = hi= 0. (2.23) 
Every solution of the original system (2.20) gives by taking y J = a solution of the 



extended system ( |2.23| ). Solutions of this kind have e = +1 (since dhi/dy a = ds a /du 1 and 
the determinant in (|2.12|) is positive). If these are the only solutions, then (|2.11|) reduces 



to 

Z = J2l = N, (2.24) 

with N the total number of solutions of the original equation. Thus if all solutions of 
the extended system are at y l = 0, then the number of solutions of the extended system, 
weighted by sign, is the same as the total number of solutions of the original system. Under 
this restriction, therefore, we do get an integral representation for the unweighted number 
of solutions of the original equation. 

Of course, the utility of all this depends on finding an interesting situation in which 
there is a suitable vanishing theorem. This paper will be based on such a situation. 

Generalization 

More generally, we will need the following variant of the above construction. Consider 
a system of dl equations 

s a = 0, a = l...d' (2.25) 



6 Geometrically, let M be the total space of the bundle V — > M, and V the pullback of V to 
M. The extended functions s a (y l ,yb) define a section of V —> M. Other formulas below have 
analogous interpretations. 
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for d variables u 1 . . .u d , with d' < d. If everything is sufficiently generic, the solutions will 
consist of disjoint, smooth, compact manifolds M a of dimension d—d' and nondegenerate in 
the sense of (|2.13| ). Introduce d! new variables yb, and extend the s a to functions s a {u l , yb). 
Introduce d additional functions hi such that hi = s ^ jh yb{ds h / du 1 ) + 0(y 2 ). Consider the 
system 

s a = hi = (2.26) 

of d + d' equations for d + d' unknowns u l , y a . 

The partition invariant Z is the number of solutions of this system weighted by signs. 
The solutions that have y = are simply the solutions of the original system s a = 0, and 
thus consist of the union of manifolds M a . Suppose that there is a vanishing theorem that 
ensures that all solutions of ( |2.26j ) are at y = 0. Then Z can be evaluated using ( |2.14j) , 
and is 

Z = J2x(V a ), (2.27) 

a 

where V a is the bundle of antighost zero modes along M a . The signs e a are all +1 because 
of cancellation between u and y. 

Moreover, in this situation, V a has a special interpretation. Since there are two sets 
of equations s a and hi, there are two sets of antighosts, say x a an d Xi- The assumption 
that the M a have the expected dimension d — d' and are nondegenerate means that the 
original antighosts \ a have no zero modes. As for the Xii they can be analyzed as follows: 
they are cotangent to the original target space M, and their zero modes are cotangent to 
the space M a of classical solutions. So V a is the cotangent bundle of M a , and the Euler 



class x(V a ) is the same as the Euler characteristic x(M a ). Hence we can rewrite ( |2.27|) as 

Z = ^ X (M Q ) = x(W), (2.28) 

a 

with W = U Q M a the space of solutions of the original equations F(u) = 0. So under the 
above-stated restrictions, the Euler characteristic of the space of solutions of a system of 
equations can be given an integral representation. 



2.3. Gauge Invariance 

The situation that we really want is a gauge invariant version of the above. So let 
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us explain how to incorporate a symmetry group in the finite dimensional model. First 
we consider the general construction that counts solutions weighted by signs and then the 
more special construction that (given a vanishing theorem) eliminates the signs. 

To incorporate a group action in these models, we assume that a compact Lie group 
G acts on M and V (preserving all the data such as the metric g on V and the section s). 
If G has dimension t, we take the dimension of M to be d = 2n + 1 and the rank of V to 
be d = In. Thus 

d-d-t=0 (2.29) 

We introduce a field <p, in the adjoint representation of G, with ghost number U = 2. 
We write <fi = Yl x =i 4> X T X with T x a basis of the Lie algebra of G. The action of T x on the 
manifold M is described by a vector field U x l , and the lifting of T x to act on the bundle 
V is described by the action on a section X '■ ^X a = U x l DiX a + Y x a bXb, with some Y. (Of 
course Dj is the covariant derivative with respect to the connection Ai on V.) 

For the fermionic symmetry, we take 

5(j> = 0. (2.30) 

The transformation laws of other fields are as follows. ( |2.3j ) is modified to 

5u l = ieif; 1 

(2-31) 

5^ = e ^u x l 

while ( |2.4]) is replaced by 

5 X a = tH a - e5 u i A i a bX b 

e ( 2 -32) 
8H a = e8 u i A i a b H b - -5 u* 5 u^ ' F l3 a bX b + ie^Y x a hX h . 

(This is just the gauge-covariant version of Sx = eH, SH = —e[(f), x], which is analogous to 
( |2.31| ) .) It is no longer the case that Q 2 = 0; rather, Q 2 is equal to a gauge transformation 
with generator (p x T x . (This structure gives a model of what mathematically is called 
equivariant cohomology; see [ p7| , |38[] .) 

To make it possible to write a Lagrangian, we introduce another multiplet (0, 77), in 
the adjoint representation of G, with ghost number U = (—2, —1), and a transformation 
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law analogous to the above: 

5(f) = ierj 



(2.33) 
5r] = ie[0,0]. 

We also pick a G-invariant metric gij on M. 

Now, set 

W = ^( X , i? + 2zs) + ^FgijUjp + W, (2.34) 

with A' a new small parameter and W consisting of possible non- minimal terms. Then 
define the Lagrangian 

1 1 <9s & 1 

L = S W =—(H, H - 2ts) + -g abX a —^ - — Fljab rrx a X b 

- ^X a X b <p x Y xab + ^rfg^Uj^ + ^4? g^UjUj <t? + (2.35) 

^ x (d k U xl - d t U xk )^ k r + 5 W. 

Notice that 5L = 5 2 W = 4> X T X (W) = 0, as W is gauge-invariant. 
Now we wish to study an integral 

1 



J (2.36) 



Vol(G) • (27v) d (-iy 

Notice that, while all other degrees of freedom are in bose-fermi pairs and so have a 
natural measure, <p is unpaired. To make sense of the integration measure in (|2.36|) , we 



pick a translation-invariant measure d<p on the Lie algebra of G. This determines a measure 
on the group manifold, and by Vol(G) we mean the volume of G with that measure. The 
choice of measure therefore cancels out of the ratio d(f)/Vol(G), so ( |2.36| ) has no unspecified 
or arbitrary normalization. It is convenient to use the chosen measure on the Lie algebra 
to define separately dcj) and drj (whose product, in any case, is naturally defined without 
any choices). Similarly, we take the Riemannian metric g^ on M to define the separate u 
and ifi measures - and measures on any subspaces of it's or i/j's. 



The standard BRST argument shows that the integral in (|2.36|) , if sufficiently well 
convergent, is a deformation invariant and depends only on the manifold M and bundle 
V, and the G action on them. There is one basic case in which this invariant is easy to 
determine. 
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That is the case in which G acts freely on M. When that happens, it is possible to 
reduce the gauge invariant problem on M to an ordinary problem, without gauge invari- 
ance, on the quotient M' = M/G. The bundle V and section s will be replaced by the 
objects V and s' over M' which pull back to V, s over M. The integral ( |2.36| ) will reduce 
to the one on M' that counts - with signs - the number of solutions of s' = on M'. 
Or equivalently, it counts with signs the number of solutions of s = on M, up to gauge 
equivalence. 

To justify these claims, note first that the statement that G acts freely on M implies 
that for any complex- valued <fi x 7^ 0, the vector field 6 = (j) x U x l has no zeroes anywhere 
on M and therefore the (f) kinetic energy in ( |2.35| ), which is |#| 2 , is strictly positive. Thus 
there are no or zero modes. (If G does not act freely at least locally, there are instead 
4> and (p zero modes, at some points on M.) 

Turning this around, the expression g xy = U x t U y J gij is for each G orbit in M a positive 
definite metric on the Lie algebra of G; it also determines a metric on the G manifold. This 
is simply the metric that comes from the fact that (i) G acts freely on M, so the orbits 
are copies of G; (ii) the orbits are embedded in M and get an induced metric from the 
Riemannian metric on M. The ratio of the induced measure to the one that was chosen in 



defining (|2.36| ) will be called ^det(g xy ). (That is really the definition of det(g xy ); as g, 



is a quadratic form rather than a matrix, its determinant only makes sense as a number if 
there is a pre-existing measure to compare to.) If we call the volume of G with the metric 
induced from the embedding Vol (G), then of course 



Vol'(G) = ^det(g xy ) ■ Vol(G). (2.37) 

Since the (p, (p kinetic energy is nondegenerate, one can perform the Gaussian integral 
over and (p. It gives a factor of 

(27rA')*det(^)- 1 . (2.38) 

Similarly, one can integrate over 77, which appears linearly in the Lagrangian. This gives a 
factor of 

^ %,,r/ r'). (2.39) 



A' 
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The delta function in ( 2.39 ) has the following interpretation. The vectors U x l generate 
the G action and so are tangent to the G orbits on M. The delta function in ( |2.39|) thus 
projects onto the components of ijj that are normal to the group orbits. The surviving 
components can be interpreted as giving a section of the pullback to M of the tangent 
bundle of M' = M/G. 

In fact, we can divide by the free action of G and reduce what is left of (|2.36| ) to an 
integral on M' . From integrating over the G orbits, we get a factor of Vol (G), which is 



given in ( |2.37| ). The delta function in ( |2.39| ) is «/ det g xy • 5(ifj) where ip are orthonormal 



components of ip tangent to the G orbits. With the Riemannian measure for -0, 

6$) = 1. (2.40) 



In this process of eliminating components of u tangent to the G orbits by dividing by 
G, and eliminating components of ijj tangent to the orbits by using the delta function, the 
factors of A' and det(g xy ) and extra factors of 2tt cancel out. What remains is a Q-invariant 
integral on M' of the standard type, with the standard measure. V and s "go along for 
the ride" in the above manipulations, and so are simply replaced on M' by the objects V 
and s' that pull back to V and s on M. 

So we can carry over all of our analysis of ( j2.7| ). When G acts freely on M, the 
invariant Z defined by the integral in (|2.36| ) simply counts, with signs, the solutions of 
s' = on M ; , or, equivalently, the gauge orbits of solutions of s = on M. 

Counting Solutions Without Signs 

If we want to find a way to count gauge orbits of solutions without signs, we must 
imitate the special construction that led to (|2.24|) and (|2.28|) . 



We recall that in that discussion, we started with fields u l , i = 1 . . .d and equations 
s a (u) = 0, a = l...d', with d' < d. Then we added dual variables y a , extended the 
s a to possibly depend on y, and added dual equations hi = 0. Among other things, 
these steps gave a system with equally many fields and equations. When the appropriate 
vanishing theorem holds, the partition function was the Euler characteristic of the space 
M of solutions of the original equations s a (u) = 0. 
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In the G-invariant case, that is not quite right, because (assuming G has dimension t 
and acts freely) by dividing by G one could remove t degrees of freedom from u, so that 
there are effectively only d — t fields to begin with. So to balance the fields and equations, 
one would need not d but d — t dual equations h. Moreover, instead of the /i's being dual 
to the tangent space of M, they should be dual to the pullback to M of the tangent space 
of M' = M/G. The latter condition will ensure that - after we descend to M' by dividing 
by G - we will arrive at the construction that we have already analyzed. 

A suitable set of d — t dual equations can be constructed as follows. Start with any 
G-invariant set of d functions hi such that 

h^y^ + Oiy 2 ). (2.41) 

Let 

L x = U x l h t . (2.42) 

Let IT be the projection operator (using the metric on M) onto the subset of /t's for which 
L = 0, and let 

h = U(h). (2.43) 

The desired set of d — t equations is h = 0. 

Suppose that a vanishing theorem ensures that the solutions of s = h = are all 
nondegenerate and have y = 0. The partition invariant for the system consisting of fields 
u l ,y a , equations s = h = 0, and symmetry group G can in that case be evaluated by 



dividing by G and using ( 2.28 ). It equals the Euler characteristic of W/G, where W is the 



space of solutions of the original equations F(u) = for u G M. 
Locality 

In field theory, however, the projection operator IT may be nonlocal and for that reason 
its use is best avoided. Instead of using this projection operator to reduce the number of 
equations, one can increase the number of fields, as follows. The assertion that n(/i) = 
is equivalent to the assertion that there exists an adjoint-valued function C x on M such 
that 

hj + C x U x l 9lJ = 0. (2.44) 
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Let us include the C x as additional fields of U = 0. Their ghost number one partners will 
be called ( x . The multiplet is the standard one 5C = ie(, 5( = ie[(f>,C]. Let k{ be any 
functions such that 

kj = hj + C x U x l 9lJ + 0(y 2 : Cy, C 2 ). (2.45) 

Consider the system of equations s a = hi = for fields u,y,C, with group action G. 
Suppose it is the case that there is a vanishing theorem ensuring that the solutions are 
all at y = C = 0. Then the partition invariant Z for this system, upon integrating out 
the (C, C) multiplet, reduces to the partition function for the system with fields u, y and 
equations s a = hi = 0. This reflects the fact that the equations k = with C present are 
equivalent to the equations h = with C absent. Under these conditions, we can again 
invoke ( |2.28|) and conclude 

Z = X {W/G), (2.46) 

with, again, W the space of solutions of the original equations. 

We should stress that the derivation of this formula has assumed that G acts freely on 
M and that W is a smooth nondegenerate compact manifold of the expected dimension. 
When these assumptions fail, the integral must be examined more closely. 



2.4- Gauge Theories In Four Dimensions 

At last we have assembled the needed tools, and we turn to our real interest - four 
dimensional gauge theories. 

We take X to be an oriented four-manifold with local coordinates x l , i = 1 . . .4. We 
pick a finite dimensional gauge group Go, and a Go bundle E with a connection A and 
curvature Fij = diAj — djAi + [Ai, Aj]. The curvature can be decomposed in self-dual and 
anti-self-dual pieces, 

Fi^ = \ [f i3 ± \e tjkl F k ^ . (2.47) 

We take G to be the group of all gauge transformations of this bundle, acting on A in the 
standard fashion, Di — ► h~ 1 Dih, where Di = di + Ai. 
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If Go is connected and simply-connected, the bundle E is determined topologically by 
a single integer called the instanton number. For Go = SU (N) this is 

1 



TtFAF, (2.48) 

M 



8tt 2 

with Tr the trace in the iV dimensional representation, k can be an arbitrary integer. If 
Go is not simply connected, k is still defined, but may not be integral; for instance, for 
G = SO(3), k G \Z. 

To apply the above constructions, we take M to be the space of connections on E. We 
take V to be the bundle of self-dual two forms with values in the adjoint-representation of 
E. A natural section of V is given by the + part of the curvature: 

s(A)=F+(A). (2.49) 

A zero of this section is called an instanton. The space W of instantons - solutions of 



s = - is infinite dimensional because of gauge equivalence. However ||45|1 , the moduli 
space of instantons M. = W/G is finite dimensional. Under assumptions analogous to the 
ones that we have made in discussing the finite dimensional examples, the dimension of 



the moduli space is [45 



dim(jM) = 4kh(G ) - dim(G )(l + b 2 + ). (2.50) 

Here b 2 + is the dimension of the space of self-dual harmonic two forms, and h(Go) is the 
dual Coxeter number of Go (equal to N for Go = SU(N)). 

In finite dimensions, under our usual assumptions, dim(.M) is equal to d — d — t, with 
d and t the dimensions of M and G and d the rank of V. In the gauge theory problem, d, 
d, and t are all infinite, but the difference d — d — t makes sense as the index of a certain 



elliptic operator that appears in the moduli problem | 45[j , and still equals dim(A^). 

If E is chosen so that dim(M) = 0, we are in the situation in which one can try to 
count, with signs, the number of points in M.. From our finite dimensional discussion, we 



7 For any compact simple Go, one defines k by a curvature integral analogous to (2.48), nor 



malized so that if Go is replaced by the universal cover Go of its identity component, then k is an 
arbitrary integer. 
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know exactly how to formulate an integral that will compute this quantity. We need a 
system with the following supermultiplets. 

At ghost numbers 0, 1 we have the gauge fields A{ and ghosts ipi. The ghosts have 
the gauge and Lorentz quantum numbers of the fields and so are a one-form with values 
in the adjoint representation. At ghost numbers —1,0, we have antighosts \ an d auxiliary 
fields H; they have the quantum numbers of the equations, and so are self-dual two-forms 
with values in the adjoint representation. At ghost number 2, we have a field with the 
quantum numbers of a generator of gauge transformations - that is, is a scalar field with 
values in the adjoint representation. At ghost numbers —2,-1 are the conjugate <fi of 
and its fermionic partner r\. 

The propagating bosonic fields are thus A, 0, - a gauge field and a complex scalar in 
the adjoint representation. This is precisely the bosonic part of the field content of A = 2 
super Yang-Mills theory. The fermionic fields ip, x, and r), transforming as (2,2), (1,3), 
and (1, 1) under SU (2) l x SU (2)r, similarly coincide with the fermions of the topologically 
twisted N = 2 theory, as described at the outset of this section. 

The bosonic part of the action can be read off from (|2.35|) . In doing so, we set 



X = X' = e 2 with e the gauge coupling. We also note that U y 3 (f) y is the change in the field 
under a gauge transformation generated by so in the gauge theory case is determined by 
the formula 5Ai = —Di<p. Finally, after eliminating the auxiliary field H, the bosonic part 



of the action is 



If we bear in mind that 



± I Tr(|F+| 2 + |A0| 2 ). (2.51) 



x 



( \F+\ 2 = ^f TrFijF ij + 7 / e^'Tri^Fw, (2.52) 

we see that - except for the last term, which is a topological invariant, a multiple of the 
instanton number k - the bosonic part of the action ( |2.51| ) is just the standard kinetic 
energy. From N = 2 super Yang-Mills theory, we are still missing a bosonic interaction 

^Tr[0,0] 2 . (2.53) 
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This will appear if for W in ( |2.34| ) we take 



W = -^Tr V [<j>,f\. (2.54) 



The whole topological Lagrangian (|2.35|) is indeed, in this situation, the twisted version 



of N = 2 super Yang-Mills theory that we recalled at the beginning of this section. The 
topological meaning of this theory is now clear, at least when the bundle E is such that 
the expected dimension of instanton moduli space is zero: this theory computes with signs 
the number of instantons, up to gauge transformation. 

For other E, the partition function of the theory vanishes because of ghost counting. 
However, for dim(A^) > 0, there are interesting BRST-invariant operators that can have 



non-trivial correlation functions. They are described in ||21|| . Their correlation functions 



are in fact the celebrated Donaldson invariants of four-manifolds. 

Euler Characteristic Of Moduli Space 

Closer to our interests in this paper is to find a way to eliminate the minus signs 
and compute, when dim(A / i) = 0, the total number of instanton solutions, up to gauge 
transformations. More generally, when dim(.M) > 0, we want to compute the Euler 
characteristic of instanton moduli space. 

From our finite dimensional discussion, we know how to do this as well: 

(1) We introduce conjugate fields with the quantum numbers of the equations. In 
the present problem, the conjugate fields are a self-dual two-form B + with values in the 
adjoint representation. They come with ghosts with the same gauge and Lorentz 
quantum numbers. 

(2) We introduce additional fields C with the quantum numbers of the gauge genera- 
tors. In the present problem, C is a scalar field with values in the adjoint representation 
of G. C comes with a ghost £ with similar quantum numbers. 

(3) We extend the original equations F + = to have possible dependence on the 
new fields. Success depends on whether the extension can be chosen so that, eventually, 
a suitable vanishing theorem will hold. In the present case, a suitable choice is to modify 
the section s to 

Sij = F + l3 + ~[C, + B+]g kl . (2.55) 
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The additions are needed to ultimately get a vanishing theorem - and compare to twisted 
N = 4 super Yang-Mills theory - as we will see. 

(4) Finally, one needs conjugate equations. Their general structure is given in ( |2.45|) ; 
up to first order in B and C, they are uniquely determined, but the higher order terms 
are arbitrary. In the present problem, the higher order terms are best set to zero. So the 
conjugate equations, found by interpreting (]2.45|) in the present situation, are k = with 



kj = D l B + l3 +D 3 C. (2.56) 

Associated with the conjugate equations, one adds new antighosts Xji an d new auxiliary 
fields. 

Let us examine the field content of the theory. The bosonic fields are the gauge field 
A, a self-dual two-form B, and three scalars C, 0, 4>. The fermionic fields are two self-dual 
two-forms, x an d ifr, two vectors if) and Xi and two scalars, rj and £. This is precisely the 
field content of a certain topologically twisted version of iV = 4 super Yang-Mills theory, 
as described at the outset of this section. 

Now let us work out the bosonic part of the action (|2.35|) . Apart from comparing to 



the A = 4 theory, this will enable us to see the conditions for a vanishing theorem. The 
crucial terms are the square of the section (|s| 2 + \ k\ 2 )/2e 2 . Integration by parts and use of 
the Jacobi identity with some slightly delicate cancellations leads to the following identity: 
I„|2 , 17.12 i r ( / i i \ 2 



+ I&I 2 1 



2e 2 2e 2 



v^Tr ((f+v + \{B lk , B fl ]gU + ±[C, B^ 

+ (Z^B lJ +AC) 2 ) 
^ J d A x^Tx LFV + \iDlBij) 2 + (AC) 2 + ^[B lkl B jk ][B ir , B jr ] 

+ \\ C i B ij\ 2 + \ B ij \ \{9ik9jl ~ 9u9jk)R + W + ijk ^j B h 



(2.57) 

with R the scalar curvature of X and W + the self-dual part of the Weyl tensor. 

The Vanishing Theorem 

Now let us look for a suitable vanishing theorem. Of course, if there is no vanishing 
theorem, the theory still exists. It is just harder to study, though conceivably richer. But 
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to study the theory, it is certainly important to understand whatever vanishing theorems 
do exist. 

According to our general discussion, the appropriate vanishing theorem would assert 
that the solutions of s = k = all have B = C = 0; then it follows that the topological 
partition function associated with the equations s = k = has for its partition function 
the Euler characteristic of instanton moduli space. 

The most obvious inference from (|2.57j ) is that if the metric is such that 



This is slightly less than we hoped for because we learn only that C is covariantly constant, 
not zero. However, the condition DiC = has the following significance: it means that 
C is covariantly constant and generates a gauge transformation that leaves the gauge 
field invariant, so that the gauge group G does not act freely on the space of solutions 
of s = k = O.i Instanton moduli space M. is singular at such points and our general 
assumptions fail there. Moreover, when Ai is singular, one would want to specify exactly 
what one means by the Euler characteristic. Our general formal arguments really need an 
extension (which we do not know how to give) when such singularities occur. 

At least informally, though, ( |2.59|) can be described by saying that when ( |2.58| ) is 
positive definite, the argument identifying the partition function with x(-M) is valid if one 
treats singularities of A4 properly. This is to be contrasted with the generic situation that 
would prevail in the absence of a vanishing theorem: then M. might be perfectly smooth, 
yet there might be solutions of s = k = with B, C ^ having nothing to do with 
instantons. 

8 A gauge field that admits a nonzero solution G of DiC = is called reducible. Such a gauge 
field can be interpreted as a connection with values in the subgroup G' of the gauge group Go 
that leaves C invariant. For instance, for Go = SU(2), G' will be the abelian group U(l). 




ijkl 



(2.58) 



for any non-zero B, then any solution of s = k = has 



B = DiC = F+ = 0. 
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The vanishing theorem that we have just stated, which is a nonlinear version of the 
45H , applies, for instance, to the four-sphere with its standard metric (for which 



W = and R > 0). However, there is a severe topological limitation on its applicability, 
namely 62 + (the dimension of the space of self-dual harmonic two forms) must vanish. To 
see this, note the following identity for a (neutral) self-dual two-form w: 

J (D l Wij) 2 = \ J {{DiWjk) 2 + Wij (^(gik9ji - gu9jk)R + W + i jk i^ w k i^j ■ (2.60) 
(This identity was part of the derivation of ( |2.57| ).) If w is harmonic, D l uiij = 0, so if 



the quadratic form in (|2.58|) is strictly positive, then w = 0. So if that quadratic form is 
positive, then 62 + = 0. 

Examples with 62 + = are very restricted and will not be useful in this paper because 
our computations will all involve gauge groups that are locally isomorphic to a product of 
SU(2ys. For SU{2) the dimension of instanton moduli space is 8k — 3(1 + &2 + ), and so is 
odd if 62 + = 0.@ The partition function is therefore zero for SU(2) if& 2 + = O.0 Manifolds 
with 5 2 + = might be of interest with other gauge groups such as SU(3). 

We will therefore need some further vanishing theorems, and we will discuss several 
variants. As a preliminary, let us consider the important case in which the quadratic form 
in ( |2.58|) is positive semi-definite. Then from (|2.60| ) we learn that a harmonic self-dual two- 
form w is covariantly constant. The existence of such a nonzero w reduces the holonomy 
group of X, and there are two possibilities: (1) If &2 + = F so there is essentially a single 
w, the holonomy group is reduced to U(2); then X is Kahler and w is the Kahler form. 
(2) If &2 + > 1, the holonomy is reduced still further. The only possibility is that X is 
hyper-Kahler and 62 + = 3. We will later discuss more precise vanishing theorems for such 
manifolds. 

The following simple fact will be useful. Though the right hand side of ( |2.57| ) is not 
manifestly positive (unless one is given some information about the curvature of X), the 



9 For SO(3), 4k is integral and the same statement holds. 

10 If instanton moduli space is an odd dimensional smooth compact manifold (without bound- 
ary), its Euler characteristic vanishes. Normally, those assumptions are too optimistic. However, 
the Euler characteristic appears in our formulas as a curvature integral - this is explicit in ( p.9|) 
- and so the Euler characteristic in the sense we want vanishes when M is of odd dimension. 
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equation itself shows that the right hand side is non-negative and vanishes when and only 
when s = k = 0. Now the right hand side of ( [2.571) is invariant under 



r:C^-C. (2.61) 

(This is part of the SU{2) global symmetry of the N = 4 super Yang-Mills theory men- 
tioned in the introduction.) Hence, given any solution of s = k = 0, we get a new solution 
by replacing C by —C. It follows that any solution of s = k = has 

D z C=[C,B] = 0, (2.62) 

without any assumption on the curvature of X. Thus, either C = or C generates a gauge 
transformation that acts trivially on both the gauge connection and B. 

If the gauge group is (locally) SU{2) (or a product of 577(2) 's) we can make a more 
precise statement since if C ^ 0, it breaks SU(2) to an abelian subgroup U(l). If C ^ 0, 
then [C, B] = implies B lies in the same £7(1) so [-By, B^i] = 0, and hence s = implies 
that F + = 0. But if &2 + > (which we may as well assume, if the gauge group is SU(2)), 
then for a generic metric on X, there are no abelian instantons.0 So for such gauge groups, 
we can assume that C = 0. 



Comparison To N = 4 

Before resuming the discussion of vanishing theorems, it will be helpful to make a 
more precise comparison of the topological theory that we have been considering so far to 
N = 4 super Yang-Mills theory. 

Apart from terms involving and 0, the bosonic part of the action in the topological 
theory is simply the right hand side of (J2.57[) . If we work on flat R 4 and set Boi = Bi , the 



11 The first Chern class of an abelian instanton is a two-dimensional cohomology class of X 
that (i) is integral, and (ii) is an eigenstate of the Hodge * operator with eigenvalue —1. (The 
minus sign is because we take the instanton equations to be that F + = 0, so that the non-zero 
part of F is F~ .) If &2 + > (which means that the subspace of H 2 (X) with eigenvalue +1 of * 
is non-empty) then for a generic metric on X, there are no non-zero cohomology classes obeying 
conditions (i) and (ii). 
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right hand side of (|2.57| ) is 

^ Tr f F V + (DiBj) 2 + (AC) 2 + *i] 2 + £[C, B 3 f \ . (2.63) 



This has an 0(4) symmetry rotating C, S^; it is the subgroup of the underlying 0(6) 
symmetry of the iV = 4 theory that does not act on </>, </>. In fact, (|2.63|) is precisely the 



bosonic part (|2.1| ) of the action of the N = 4 theory with two of the scalars - (j) and </> - 
set to zero, and a 9 term added. 

The bosonic terms involving <p can be found in a fashion similar to our discussion of 
the N = 2 theory. The <fi kinetic energy arises as in the discussion of ( |2.51| ). The other 
bosonic interactions, namely 

^J^x^gTr [[CMC A] + [B 4 ,^[B 4 ,0] - \[<P,W^) (2-64) 

originate as in ( |2.53| ) by adding to W some terms with the structure Tr t)t[T, <fi] where T 
(= 0, C, or B) is a bosonic field that transforms into r/T under the fermionic symmetry. 
In this way one gets a topological theory whose bosonic part on flat R 4 agrees precisely 
with N = 4 super Yang-Mills theory. The fermionic part of the action similarly coincides 
on flat R 4 with the N = 4 theory; it could hardly be otherwise given the supersymmetry. 

Vanishing Theorems On Kdhler Manifolds 

On a general four-manifold X, the topological theory differs from the N = 4 theory 
by the twisting that shifts the spins and by the curvature coupling on the right hand side 
of (|2.57| ). Generally, these couplings break the 0(4) symmetry that we noted in ( |2.63|) , 
leaving only the Z2 in ( |2.61| ) and a similar Z2 acting on B. There is an important case 
in which a larger subgroup survives. This is the case that X is a Kahler manifold. In 
that case, one can naturally decompose the self-dual two-form B into components of type 
(2, 0), (1, 1), and (0, 2). We will write the (1, 1) piece as bu, with ui the Kahler form and 
b a scalar field, and call the (2, 0) and (0, 2) pieces (3 and j3. 

Of the SO (4) symmetry of (|2.63|) on a flat manifold, an 0(2) that rotates b and C 
survives on a Kahler manifold. In fact, b and C are both scalars and have kinetic energy 
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of the same form. Also, the (1,1) piece of B is in the kernel of ( |2.58|) (this is more or less 
obvious from ( |2.60|) ) so the curvature term does not spoil the symmetry between b and C. 

Hence the arguments that we gave above for C carry over to 6, and for instance, on a 
Kahler manifold, with gauge group locally a product of SU (2)'s, we can assume that 6 = 
in a solution of s = k = 0, since we have proved that assertion for C. 

Now let us analyze the situation for the (2, 0) and (0, 2) part of B. For these com- 
ponents, Q2.58Q collapses to a positive multiple of R Tr j3/3, and so is positive if the scalar 
curvature R is positive. Thus, in that case [3 = 0. Even if R is zero rather than positive, 
( [2.571) implies that DiBjk = 0, so that if not zero B is covariantly constant. For gauge 
group locally a product of 577(2) 's, it follows (unless A is gauge equivalent to the trivial 
connection) that [B, B] = and hence (if s vanishes) F + = oS But for a generic Kahler 
metric, there are no abelian instantons (Kahler manifolds always have &2 + > since the 
Kahler form is self-dual), contradicting the fact that B ^ forces the connection to be 
abelian. 

So we conclude that, for a Kahler metric with R > and gauge group locally a 
product of S77(2)'s, the desired vanishing theorem holds and the partition function of the 
topological theory is the Euler characteristic of instanton moduli space. This important 
vanishing theorem applies to examples such as K3, CP 2 , and blowups of CP 2 at a small 
number of points. 

More General Kahler Manifolds 

What about more general Kahler manifolds? We still have b = C = 0. The curvature 
F+ can be usefully decomposed on a Kahler manifold in pieces F p,q of types (p, q) , with 
(f) o) = (2, 0), (1, 1), or (0, 2). The equations s = k = give first 

F 2,0 = F o,2 = 0? (2.6h) 
that is, the connection A endows the bundle E with a holomorphic structure; second 

Df3 = 0, (2.66) 

12 If A is trivial, the stated conclusions still hold since then F + = and s = implies that 
[B,B]=0. 
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that is, f3 is a holomorphic section of End(i?) <8> K (with K the canonical bundle of X), 
and finally 

u AF+ [p,p\ =0 (2.67) 

{uj is the Kahler form, of type (1,1); only the (1,1) part of F contributes in the equation. 
Interpreting j3 as (2,0) form, both terms in the equation are (2,2) forms.) Analogy with 
other somewhat similar problems (such as the "Higgs bundle" equations j46j) suggests that 
the last equation can be interpreted holomorphically as a kind of stability condition for 
the pair (E,j3). If so, a determination of contributions - if any - to the partition function 
from solutions with (3 ^ should be quite accessible. 

The following is a severe constraint. The above equations have the obvious U(l) 
symmetry 

P -> e iB [5 (2.68) 

(which is, again, a survivor of the SO (4) symmetry of (|2.63|) ). The contributions of solu- 
tions with j3 7^ to the topological partition function would equal the number of gauge 
orbits of such solutions, weighted by signs, if the number is finite. If there is instead a 
manifold W of such solutions, the contribution (according to equation ( |2.14| )) is ±x(V) 
with V the bundle of antighost zero modes. The Euler class x(V) of a £/(l)-equivariant 
bundle V can be computed by summing over fixed points of the U(l) action. Thus, the 
only solutions with /3 ^ that really have to be considered are those that are invariant 
under ( |2.68| ), up to a gauge transformation. 

That is only possible if the gauge connection is reducible. For gauge group SU (2), for 
instance, the only fixed points with f3 ^ are abelian configurations, with a connection of 
the form 

A-ft I) (2.69) 



0= I ° n ) • (2-70) 



and P of the form 

*=( 

x * 

Thus, the bundle E is E = L © L _1 with L a holomorphic line bundle; P is a holomorphic 
section of K <g> L~ 2 . Equation (|2.67| ) reduces in this situation to 

ujAF = PAP (2.71) 
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with now F the curvature of the connection on L. Since the right hand side is positive, 
this equation requires that [u] ■ c\(L) > 0, where [u] is the Kahler class and c\(L) the 
first Chern class of L. Conversely, if L is a line bundle with [u] ■ ci(L) > and f3 is a 
holomorphic section of K ® L -2 , then a standard convexity argument shows formally that 



there is a unique metric on L giving a solution of (|2.71| ). 

We can now uncover a qualitative consequence of the failure of the vanishing theorem 
on some Kahler manifolds. Of course, when the vanishing theorem holds, solutions of the 
equations are instantons and necessarily have 02(E) > 0. But connections of the form 
( F69D have 

c 2 (E) = - Cl (L) 2 , (2.72) 

and this can be negative. For instance, on a minimal surface of general type, we can 
obey the conditions if we take L = K 1 / 2 and (3 = 1. (In fact, in that case one can pick 
a Kahler metric such that [u] = c\(K) and solve ( |2.71|) very explicitly with F a positive 
multiple of a;.) This solution has instanton number 

c 2 (E) = -\c x (Kf. (2.73) 

Conversely, on a minimal surface of general type, if we pick a Kahler metric with 
[u] = ci(L), the instanton number of a solution of (|2.71|) is bounded below by (|2.73|) . 
The Hodge index theorem states that the intersection pairing on H > (X) is of "Lorentz 

signature" (H ...—). On a minimal surface of general type, c\(K) 2 > and hence 

ci(L) = Xci(K) + a, where A is a real number, a ■ C\(K) = 0, a ■ a < 0. The fact 
that K (gi L~ 2 has a nonzero holomorphic section f3 implies that A < 1/2; the fact that 
[u>] ■ c\(L) > implies A > 0. These conditions together imply ci(L) 2 < c\(K) 2 /4. This 
will be useful in 55. 



13 The condition roughly means that the line bundle K is very positive. "Most" two dimensional 
compact complex manifolds are of this type or obtained from such manifolds by blowing up points. 

14 That is, L is a line bundle such that L L = K . Such a line bundle only exists globally if X 
is a spin manifold, so only if X is spin does the bundle E = L © L _1 exist and contribute for the 
SU(2) theory. The corresponding SO (3) bundle ad(E) = L 2 © ® L~ 2 = K ®0 @ K' 1 (here O 
is a trivial line bundle) always exists and contributes to the SO(3) theory. 

15 The argument was explained to us by D. Morrison. 



33 



One more small extension of the vanishing theorems will be helpful in §4. K3 with 
a generic complex structure has no non-trivial holomorphic line bundles. Let X be such 
a generic K3 surface with one point blown up. On X there is an exceptional divisor D 
produced by the blow up; the canonical bundle is K = O(D), with c\(K) 2 = — 1. Any line 
bundle on X is of the form K® n for some integer n. Any Kahler form wonl has 

[u] ■ Cl {K) = u ■ [D] > 0; (2.74) 

the right hand side is just the area of D in the Kahler metric. Let us now show that ( |2.71| ) 



has no non-trivial solutions on X . The line bundle L would have to be of the form K® n as 
those are the only line bundles. For K £g> L~ 2 to have a non-zero holomorphic section, one 
needs n < 0. But in view of (|2.74j ), [u] ■ c\(L) > requires n > 0. So we get a vanishing 
theorem on X: one can compute via instantons. Essentially the same argument holds for 
K3 with any number of points blown up. 

2.5. Singularities Of Instanton Moduli Space 

In at least one respect, the above discussion is misleading. We have constantly assumed 
that the moduli space M. of solutions of the original equations is compact and non-singular. 
For moduli spaces of instantons, those assumptions are unrealistic. 

Compactness fails because instantons can shrink to zero size. The Euler characteristic 
entered our problem as a kind of curvature integral, beginning with ( |2.9| ). Only for a com- 
pact manifold (or for connections of very special type) does such an integral reproduce the 
Euler characteristic as defined topologically. To facilitate computations, one would hope 
that the curvature integral can be interpreted as the Euler characteristic of some compact- 
ification of M.. At least for X a Kahler manifold, there is a natural compactification of M. 
by stable sheaves in algebraic geometry; we will very optimistically use this compactifica- 
tion, since our computations will be based on results borrowed from mathematicians who 
used it. We do not know how to justify this assumption. 

Physically, the only obvious place that a compactification of M. would come from is 
string theory. To the extent that M. can be interpreted as a space of classical solutions 
of string theory, the good ultraviolet behavior of string theory should lead to a natural 
compactification. 
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Instanton moduli space may also have singularities; these arise at points in M. where 
C or B has a zero mode. For a generic metric on X , A4 is smooth. Nevertheless, the zero 
modes and singularities that occur in a one-parameter family of metrics are important in 
verifying the formal arguments for topological invariance of the theory. 



2.6. Self- Con jug acy 

One point that may puzzle the reader is that the twisted N = 4 system has N = 2 
topological symmetry and an SU (2) global symmetry; but we have so far discussed it as an 
N = 1 topological system, and (in discussing the vanishing theorems) we exhibited only a 
small piece of the global symmetry. Here we will fill this gap in generality. It is convenient 
to do so in the general context of the whole class of models that we have discussed in this 
section. 

First we consider the general construction that counts solutions of a system of equa- 
tions weighted by signs. The fields at various ghost numbers follows: 

U = 2: (j) x 
U = 1 : ip l 

U = : u\ H a (2.75) 
U = -1 : X a , rf 
U = -2: X . 

Here (u\ ifi 1 ) are a multiplet of fields and ghosts; (x a , H a ) are a multiplet associated with 
the equations; and (f>, <fi, and rj are fields associated with the symmetry group G. 

Now we consider the more detailed construction that eliminates signs when a vanish- 
ing theorem holds. In this case, there are three sets of multiplets containing fields: the 
"original" multiplets (w\-i/> 1 ), the "dual" multiplets {yai^a), an d the multiplet (C X X X ) 
associated with the symmetries. In addition, there are auxiliary multiplets (x a , H a ) asso- 
ciated with the original equations and (xi,-ffj) associated with the dual equations. The 
other fields (p x , </> , rf are unchanged from the general picture in ( |2.75| ) . So we get this 
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setup: 

U = 2 : 4> x 

u = i-. r,^x x 

U = 0:u\y a ,C x ,H a ,H l (2.76) 

U = -l: X a ,X*,V X 
U = -2: X . 

Now, ( |2.76|) differs from the more generai structure ( |2.75| ) in being seif-conjugate in 
the following sense. The fields at ghost number U = — 1 have the same quantum numbers 
as the fields at ghost number 1 (if we bear in mind that there are metrics gij and g a b that 
can be used to raise and lower indices), and likewise the quantum numbers are the same 
for ghost number 2 and —2. In this self-conjugate case, instead of our usual BRST-like 
operator Q of U = 1, we obviously could define a similar operator Q' of U = — 1. It will 
soon be clear that we can take Q 2 = (Q') 2 = {Q, Q'} = 0, up to gauge transformation. 



One can actually ask for more. Define an SU(2) action on the fields in (|2.76|) such 
that 4>i C, <p make a three-dimensional representation, if), x and i/j, x make up two different 
two-dimensional representations, and the other fields are invariant. One can arrange so 
that the pair Q, Q' transform in a two-dimensional representation of this SU(2). 

To implement this, even in a superfield language, is really quite easy. Introduce a 
doublet of anticommuting variables 6 , A = 1,2, transforming in a two-dimensional repre- 
sentation of SU (2). Arrange 0, C, 4> into an SU (2) triplet 4>ab = 4>ba- The supersymmetry 
transformations are to be 

QA = i^j-e B [<p AB , •], (2.77) 

with [4>AB-, ' ] denoting the infinitesimal G transformation generated by the Lie algebra 
element 4>ab- Obviously, {Qa, Qb} = up to a gauge transformation. 
Form superfields 

T* = u* + iQ A ^ A l + ltAB0A ° B H\ (2.78) 
with if) 1 and x l being the components of i^a 1 -, and 

Y a = y a + i9 A ^ A a + %6AB l ° B H a , (2.79) 
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with ip a and x a being the components of ipA a - The transformation laws given earlier for 
the fields in those multiplets can be summarized by 

ST* = -ie A {Q A ,T i } 

(2.80) 

5Y a _ ST i A a bY b = -ie A {Q A: Y b }. 

If we combine rj, (, as an SU(2) doublet r/A, then the transformation laws for the fields 4>ab 
and rjA associated with the gauge symmetry are 

S(pAB = i^AVB + esVA 

1 (2-81) 
$VB = ~-e A [<PbcA CA } ■ 

Unfortunately, we do not know of a general description of the possible QA-invariant 
Lagrangians. In the case of gauge theory, of course, the standard N = 4 Lagrangian is 
one. 



3. Predictions of Strong- Weak duality 

In this section we formulate precisely the predictions of strong weak duality that we 
are going to test. Consider N = 4 super Yang-Mills theory with gauge group on 
flat Euclidean four-dimensional space. Then the fact that the energy-momentum tensor 
T^ v is invariant under S-duality implies that if we consider the same theory on a curved 
background it should still respect S'-duality. The simplest object to compute is the parti- 
tion function of the theory. This will in general depend on the manifold, its metric g^, 
and the gauge coupling constant and 9 angle which we combine as r = J- + One 
would expect from S-duality that (up to some universal factors) the partition function 
Zj|f(T,f, <7 M „, G) should transform under r — > — 1/r to the same object for the dual group 
the transformation from G to G is part of the original Montonen- Olive conjecture. 

To test any prediction of S-duality, we need to be able to compute exact (or at least 
strong coupling) quantities in the theory, as S-duality relates weak to strong coupling. 

16 We are making a small change of notation relative to §2, where the finite dimensional gauge 
group was called Go, and the name G was reserved for the group of local gauge transformations, 
that is (roughly) the group of maps of space-time to Go- 

17 Recall that G is the group whose weight lattice is dual to that of G. 
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Unfortunately the partition function Zm in general is too difficult to compute.lll 



Here is 



where topological twisting becomes helpful. First of all, the twisted theory should still 
be S'-dual since twisting, as discussed in the last section, basically means introducing 
background fields which couple to the SU(4) global symmetry current of the theory, and 
those currents are S'-dual. The theory being topological means in particular that, barring 
anomalies,^! the partition function is a holomorphic function of r and is independent of the 
metric on M. Thus we would formally expect the partition function on M to depend only 
on r and the group chosen; we write it therefore as Zm(t, G). As discussed in the previous 
section, Zm is determined (when the appropriate vanishing theorem holds) by the Euler 
characteristics of instanton moduli spaces. Suppose for simplicity that G is connected and 
simply-connected. Then G-bundles on M are classified by a single integer, the instanton 
number. If Ai n is the moduli space of instantons of instanton number n, x denotes the 
Euler characteristic, and q = exp(27rzr), then the partition function would bel^l 



(#Z(G) is the number of elements of the center Z{G) of G; this factor is present because 
Z(G) acts trivially on the space of connections and one divides by it in performing the 
Feynman path integral.) Under the S transformation, r — ► — ~, how should Zm transform? 
The most naive guess is that simply 



in other words that Zm is strictly invariant under S-duality. 

18 The case that M is a hyper-Kahler manifold is an exception as we see later. 

19 As we will find later, such anomalies do occur in certain cases, as in Donaldson theory. 
However the discussion of modular properties of the partition function in the rest of this section 
still holds. 

20 If G is not connected and simply-connected, the classification of bundles is finer, as we discuss 
later, and the instanton number n may not be an integer. However, we still write the following 
formula, with the sum now running over all topological types of G-bundles. The discussion below 
is valid with obvious modifications. In any case, we systematically examine these issues later. 




(3.1) 



Z m (-1/t, G) = Z m (t, G), 
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However, two natural generalizations of this come to mind. One is the possibility that 
instead of being modular invariant, Zm might transform like a "modular form," 

/ T \w/2 ^ 

Z m (-1/t,G)=±( k -) Z m (t,G) (3.2) 

for some w (the factor of i in the denominator is there to guarantee S 2 = 1; this leaves 
room for an extra overall ± sign). One other familiar fact suggests a modification of S- 
duality in curved space: the leading power of q in a modular object is not always 0. For 
example the Dedekind ^-function has an integral expansion multiplied by . In string 
theory this comes from a shift in the zero point of the energy. Similarly here we might 
expect a shift in the zero point of the instanton number, as a result of which the formula 
for the partition function should be modified by an overall multiplicative factor to be 

^ = ^J&(W (3.3) 

for some s. 

To give at least some explanation of how such subtleties could arise, note that even if a 
Lagrangian L is S-dual in flat space and has an S-dual extension to curved space, we have 
to ask precisely what extension of L to curved space is S-dual. For instance, as we saw 
in §2, the BRST symmetry of the twisted theory requires the presence of some curvature 
couplings that one might not have guessed. Even if all (/-number terms are known in the 
extension of L to curved space (in the twisted theory they are all determined by BRST 
invariance, modulo BRST commutators), one can still add c-number terms. Thus, if L\ is 
one extension of the theory to curved space, another is 

L'i = L 1+ [ d^g (e(r) + f(r)R + f(r)R 2 + ...), (3.4) 

J M 

where the terms are local operators constructed from the metric, and we have to ask 
whether it is Li or L'\ that is S'-dual. We have here made an expansion in local operators 
because we assume that the statement "the theory is S-dual in curved space" means that 
there is a local Lagrangian which is S'-dual in curved space. 

Taking Li to be the twisted N = A theory as formulated in §2, topological invariance 
means that the c-number terms must themselves be topological invariants. The only 
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topological invariants of a four-manifold that can be written as the integral of a local 
operator are the Euler characteristic x an d the signature a. Thus, the unknown c-number 
terms must be of the form e(r)x + f(r)a, with e and / being unknown functions of r. If 
this is so, then Zm defined as in ( |3.3| ) would fail to be SL(2, Z)-invariant by a universal 
X and (T-dependent factor, and with suitable e and /, this could lead to the subtleties 
suggested above. We now however have the additional information that we should expect 
the modular weight w/2 and the instanton shift s to be linear functions of x an d cr: 

w = ax + ba 

(3.5) 

s = ax + ftcr, 

with universal constants a, 6, a, and ft. For the physical N = 4 theory, the coefficients b 
and ft would have to be zero as a is odd under parity (which is a symmetry of the physical 
model when 9 = 0). For the twisted theory, parity is violated explicitly and it is not clear 
that b and ft should vanish. However, it will turn out that they do. 

A further subtlety arises if one does not require that s be integral. Then, although 
( ft.l|) is strictly invariant under 

T=(J J). (3.6) 

which corresponds to — ► 9 + 2n or t — > r + 1, ( |3.3| ) would change under r by an 
overall phase. One could interpret this as a kind of global gravitational anomaly in the 2tt 
periodicity in 9 - or perhaps better, as a clash between ^-duality and the 2n periodicity. 
If this possibility is realized, then Zm is an object somewhat like the Dedekind r\ function 
- transforming under SL(2, Z) almost like a modular form of some particular weight, but 
with some additional phase factors. We will see that all of these possibilities are realized; 
the zero of instanton number is shifted by a multiple of x, which is not always integral so 
one gets the phases just mentioned, and there is also a modular weight that is a multiple 
of X- 

It is convenient sometimes to get rid of the modular weight. This can be done by 
multiplying Zm by r\~ w to get 

Zm = V w Zm 
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Under S-duality we have 

We denote the shift from integer power q expansion in Zm by q~ c / 24 . This definition is 
motivated by a similar appearance of the central charge c in two-dimensional conformal 
field theory. In principle c depends on both M and G. Note that c is related to s by 

c = 24s + w = a'x + b'a (3.7) 

where a', b' can be written in terms of a, 6, a, (3. 

3.1. Fractional Instanton Numbers 

The original Montonen- Olive conjecture states that under S : r — > — 1/r, the gauge 
group G is replaced by the dual group G. With the exception of G = Eg, it is impossible 
for both G and G to be simply-connected, and therefore we are all but forced to discuss 
the phenomena that arise for non-simply-connected groups. 

Before being general, let us discuss the important special case that G = SU{2); then 
the dual group is G = SU(2)/Z 2 = SO(S). Of course, tti(50(3)) = Z 2 . 

SU(2) or SO(3) bundles on the four-sphere are both classified by a single integer, the 
instanton number. That is not so on a more general four-manifold. The basic difference 
between SU(2) and SO(3) bundles arises first in two dimensions. On the two-sphere S 2 , 
bundles with any gauge group G are classified by 7r 1 (G), so SU(2) bundles are trivial, but 
there are two types of SO(3) bundle, labeled by Z 2 . The non-trivial SO(3) bundle can 
be described explicitly as follows. Let L be the basic U(l) magnetic monopole bundle of 
magnetic charge one. Then the SU(2) bundle F = L 1 / 2 ©L -1 / 2 does not exist, since L 1 / 2 
- which would be a line bundle of magnetic charge 1/2 - does not exist. However, the 
SO (3) bundle that would be derived from F (for instance, by taking the symmetric part 
of F ® F to construct spin one from spin 1/2) is 

E = L®0®L~ 1 (3.8) 

(O is a trivial bundle), and does exist. The fact that E exists but there is no associated 
SU{2) bundle makes it clear that E is non-trivial. 
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Now, if E is an SO (3) bundle on a four-manifold M, then for every two sphere S in 
M, we can assign an element a(S) G Z 2 that measures whether the restriction of E to S 



is trivial or not. a(S) is what 't Hooft |30| called the non-abelian magnetic flux through 
S. One justification for the terminology is that if S is the boundary of a three-manifold, 
then automatically a(S) = 0, as one would expect for magnetic flux. Mathematically, 
the association S — > a(S) (or a slight elaboration of it if M is not simply-connected), is 
an element of H 2 (M, Z 2 ) which is called the second Stieffel- Whitney class of E, 102(E). 
The SO (3) bundles that are associated with SU{2) bundles are precisely those for which 
W2 = 0. 

For instance, if E is of the form ( |3.8| ) for some line bundle L, then a(S) is the mod 
two reduction of (S, c\(L)) (the latter is the pairing of S with the first Chern class ci(L); it 
vanishes if S is a boundary). Therefore, W2(E) in this situation is the mod two reduction 
of ci(L), a fact that will be used in §5. 

Since tci(SO(3)) = Z 2 and 7Ts(SO(3)) = Z are the only non-trivial homotopy groups 
of SO (3) in dimensions low enough to matter, an SO (3) bundle Bona four-manifold M is 
classified by two invariants, which are v = w^iE) and the instanton number k. However, 
these are correlated in a perhaps surprising fashion; k is not an integer in general, but 
rather 

k = - V —^- mod 1. (3.9) 

Here v ■ v is defined as follows: one lifts v to an integral cohomology class v' and then 
interprets v ■ v as the usual cup product v' ■ v'; it is evident that v ■ v/4 is independent of 
the lifting modulo 1. 

At least for simply connected M, ( |3.9| ) can be proved as follows. After lifting v as 
above, pick a line bundle L with a(L) = v' and let E' = L © O © L" 1 . Thus w 2 (E') = 
W2(E). The SU(2) bundle associated with E' would be L 1 / 2 © L -1 / 2 and its instanton 
number would be — ci(L) 2 /4 = —v-v/4. HI Thus we have proved (|3.9|) for a bundle with the 



21 The fact that this SU(2) bundle may not really exist is immaterial; the instanton number is 
defined by a curvature integral which can be taken either in the spin one-half representation or - 
with an extra factor of 1/4 - in the spin one representation. Writing down the formal expression 
for E' simply lets us use a short-cut: for an SU(2) bundle N iV -1 with iV a line bundle, the 
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same W2 as E. However, any two SO (3) bundles E and E' on M with the same W2 become 
isomorphic if a point p is deleted from M (since then the classification of bundles involves 
the homotopy groups 7Tk(SO(3)) for k < 2, and 7Ti is the only non-trivial one). They differ 
therefore by a topological twist that is localized near p, but the localized topological twists 
are the ones that can be defined on the four-sphere, and shift the instanton number by an 
integer. 

3.2. The Group And The Dual Group 

( |3.9|) implies that the instanton numbers in the SO(3) theory, for which we sum over 
all bundles with arbitrary W2, take values in Z/4 but not in general in Z. Therefore, while 
the SU(2) theory on a four-manifold is invariant under r — > r + 1, the SO(3) theory is in 
general only invariant under r — > r + 4. There is one situation in which this is improved 
slightly: if M is a spin manifold, then v -v is even for all v; and hence the SO (3) instanton 
numbers take values in Z/2, so the SO (3) theory is invariant under r — > r + 2. 

Now we can understand better the precise implications of S'-duality. The modular 
transformation T : r — ► r + 1 maps the SU(2) theory to itself (perhaps with an anomalous 
phase, as discussed above). The transformation S : r — > — 1/r, according to Montonen 
and Olive, maps SU(2) to SO(3). A tranformation by T 4 then maps the SO (3) theory 
to itself, and subsequent transformation by S will map us back to SU{2). So the SU{2) 
theory will be mapped to itself by the operation ST A S. The SU (2) theory should therefore 
be transformed to itself by the subgroup of SL(2, Z) generated by the matrices 



These matrices generate the subgroup of SL(2, Z) consisting of matrices whose lower left 
entry is congruent to modulo 4. This subgroup is known as To (4). In the case of a spin 
manifold, one would get the subgroup of SL(2, Z) generated by S and ST 2 S; this is the 
group Tq (2) of matrices whose lower left entry is congruent to modulo 2. 

instanton number is —ci(N) 2 . 




(3.10) 
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Therefore, the prediction of 5-duality is that the 577(2) (or SO (3)) partition function 
Zm is modular for r (4), or r (2) for spin manifolds. Zm is not quite a modular form 
for these groups, but transforms with extra phases, because we have allowed a shift in 
the instanton number in multiplying by q~ s and moreover because ( |3.2| ) is not quite the 
conventional definition of the transformation of a modular form. We can get rid of most 
of the phases by studying the function 

f M = V~ W+C Z = rfZ (3.11) 

We see that Jm has weight c/2 and is invariant under r — > r + 1. It is still not quite a 
standard modular form unless c is an integer, but this will turn out to be the case in our 
examples. If c is an odd integer, we get what is called a modular form of half-integral 
weight. HI 

Let us note some aspects of the fundamental domains of each of these two groups. 
These fundamental domains parametrize the inequivalent values of r if 5-duality is valid. 
The fundamental domain of To (2) has two cusps and a Z2 orbifold point.il One cusp 
(say at r = 00) corresponds to the instanton expansion for SU (2) and one (say at r = 0) 
corresponds to the instanton expansion for SO (3). The Z2 orbifold point has 9 = n from 
the SU(2) viewpoint. The fundamental domain of To (4) is a two-fold cover of that of To (2); 
this eliminates the orbifold point and creates a third cusp. While again two of the cusps 
correspond to the instanton expansions of the SU(2) and SO (3) theories, the meaning of 
the third cusp is more mysterious. We will make a proposal later: the expansion near 
the third cusp is the instanton expansion of the SO (3) theory restricted to bundles E for 
which 102(E) = W2(M). Here W2 (M) is the second Stieffel- Whitney class of the tangent 
bundle of M and vanishes precisely if M is a spin manifold; when that occurs two of the 
three cusps of r (4) are equivalent and we get back to the r (2) picture. 



22 These are objects that transform as 6 C under To (4), where 9 is the usual theta function. 
To check that fu has the appropriate q expansion near the third cusp of To (4) one needs the 
sharpened version discussed later of the S'-duality conjecture, /m may still fail to be a modular 
form in the usual sense if it has poles at some cusps, which may occur - as we discussed in 
connection with ( |2.71| )- if bundles of negative instanton number contribute due to a failure of the 
vanishing theorem. 

23 See [47|, p. 231, solutions to exercises 12 and 14, for this and subsequent assertions. 
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Generalization To SU (N) 

For SU(N), the story is very similar. The dual group is now SU(N) /Zjv- As 
tti(SU(N)/Zn) = Zat, an SU(N)/Zn bundle has in addition to the instanton number 
an additional invariant v taking values in H 2 (M, Zjv). One can think of this as a Zjv- 
valued magnetic flux. The instanton number h v for a bundle characterized by a magnetic 
flux v is not necessarily an integer but obeys 

V ■ V V ■ V , , 

K = 2N " ~T~ m ° d 1 (3 - 12) 

Note that this is invariant under v — > v + Nv' and so is independent of the choice of lifting 
of v to an integral class. 

To justify ( p.l2p , we follow the same steps as in the SO (3) case; it is enough to prove 
the result for some bundle with given v. Picking a line bundle L with ci(L) = v mod N, 
we let F be the "SU(N) bundle" F = @^ =1 L a \ where (for instance) a; = l/N for i < N 
and a at = — (N — 1)/N. Though F may not exist as an SU(N) bundle, the corresponding 
SU (N) /Tin bundle E' does. Its "magnetic flux" is v. The instanton number of E' (which 
can be computed as if F exists) is 




(3.13) 



and this gives ( |3.12| ). 

The vector a = (ai, . . . ajv) used above, with Oj G Z^v, — aj G Z, and J2i a i = 0' 
can be interpreted as a vector in the weight lattice T w of SU(N). T w contains the root 
lattice r as a sublattice of index iV with T w /F = Z^r. If we replace a by an arbitrary 
vector in T w , the flux v of the bundle E would be [a] ■ ci(L), and the instanton number 
would be — ci(L) 2 a 2 /2 modulo 1 (with [a] the coset of a in T w /T = Zat, and a 2 the length 
squared of a). This is similar to conformal field theory, where SU(N) characters at level 
one are theta functions of the V cosets in r„, and for the character derived from a given 
coset, Lq = a 2 /2 modulo 1. The analogy would be perfect for a one dimensional H 2 (M, Z) 
generated by e with e 2 = —1 (see the discussion below). 

Just as in conformal field theory, this way of writing things leads at once to the 
generalization for arbitrary simply laced groups. The magnetic flux v takes values in 
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H 2 (M, F w /r), and the instanton numbers are equal modulo 1 to —v ■ v/2 where the v ■ v 
is computed using the tensor product of the natural inner product on H 2 (M, Z) and the 
inner product on the weight lattice T w . 



3.3. Sharpening the S -duality Conjecture 

Let us consider the theory with SU(N)/Zn gauge group. As we have discussed, an 
SU(N) /Zn bundle on M has a "magnetic" invariant v G H 2 (M, Zjv). Let 62 be the second 
Betti number of M. If for simplicity we suppose that M is simply-connected (or at least 
that there is no iV-torsion in if 2 (M, Z)), then the magnetic flux takes distinct possible 
values. In the SU(N) /Zjv theory we sum over all these possibilities. We will consider the 
partition function of the theory with a fixed magnetic flux vector v, that is summing over 
all values of the instanton number with a fixed v. Let us denote this partition function 
by Z v where we hide the M dependence to avoid too many labels. It is natural to ask 
how the individual Z v J s transform under SL(2, Z). It will also be natural to study how 
Z v = rj~ w Z v transforms. The T-transformation is particularly easy. From ( |3.12|) and from 
the fact that we have shifted by an overall q~ s = q~^ + ^k we have 

Z v (t + 1) = exp(27u(-s + h v )) Z v (r) 

Z v (t + 1) = exp(27rz(-^ + h v )) Z v (r) (3.14) 

where h v is given by ( |3.12| ). More subtle is of course the S transformation. 

It has been shown by 't Hooft [30] that in the Hamiltonian formulation, analogous to 
the Zjv-valued magnetic flux, it is natural to introduce also Z^-valued electrical fluxes. 
Moreover he showed that they are Fourier transforms of one another. In our setting this 
means that the path integral with electrical flux w is 

%T = const. }^exp f — —\ Z v 

where again the v-w is the inner product on H 2 (M). It is natural to extend the conjecture 
of strong/weak duality to include not only the exchange of ordinary electric and magnetic 
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flux but also the statement that 't Hooft's electric and magnetic fluxes get exchanged under 
S transformation. This means in particular that the transformation laws are 

Z u (-l/r) = ±N->»/* Q " J>p(^^(r) 

V 

Z u (-l/r) = ±iV-^/ 2 ^exp(^p)Z„(r). (3.15) 

V 

The proportionality constant was fixed by requiring S = 1, which leaves a ± sign ambi- 
guity in fixing S. The proof that S 2 = 1 requires using the fact that by Poincare duality 
the cup product on the integral lattice H 2 (M, Z) is self-dual. In particular one uses 

£exp(^^)=iV^, (3.16) 

V 

The partition function for the SU(N) theory is the same as the contribution of zero 
magnetic flux to the SU(N) /Zn partition function, times an elementary factor: 

Zsu(N) = N- 1+b ^Z Q (3.17) 

Here b\ is the first Betti number of M. The prefactor in (|3.17| ) arises as follows. The volume 
of the group of SU(N) gauge transformations is (i) bigger than that of SU(N)/Zn by a 
factor of N because constant gauge transformations by an element of the center of SU(N) 
are non-trivial in SU(N) but trivial in SU(N)/Zn, but (ii) smaller by a factor of N bl 
because an SU(N) /Zjy gauge transformation, in traversing a loop in M, may make a non- 
trivial loop in tti(SU(N)/Zn), a possibility that is absent for SU(N). (For simplicity we 
assume that Hi(M, Z) is torsion-free or at least that the torsion is prime to N, so that the 
ordinary first Betti number b\ enters this assertion.) So the volume of the gauge group for 
SU(N) is iV 1-61 times that of SU(N)/Zn, and the SU(N) partition function is obtained 
by dividing by this factor the contribution Zq of bundles with zero flux to the SU (N)/Zn 
partition function. 

In the SU(N) /Z^ theory we have to sum over all allowed bundles with equal weight, 
which means summing over all allowed magnetic flux vectors. So 

Z SU(N)/Z N =22 Z v 

v 
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Thus, the original Montonen-Olive relation between SU(N) and SU{N)/Z^, with some 
slight correction factors, is a consequence of (|3.15|) : 

w 

Z S U(N)(-l/r) = ±N~ 1+b ^ (Jr) 2 Z su(n)/Zn (t) 

w 

= ±N~^ 2 (~) 2 Z su(n)/Zn (r) (3.18) 

The Third Cusp 

Using ( |3.15| ), we can determine the meaning of the third cusp for the SU (2) or SO (3) 
theory. The SX(2, Z) transformation ST 2 S takes the SU{2) cusp to the other cusp. Since 
the partition function at the SU(2) cusp is proportional to Z , applying ST 2 S to it with 
the help of ( |3.15| ) and ( B.14|) (and setting iV = 2) we get 

Z ° - i J2(- i y <V+U)z - = i E(- 1 ) t " (U ' 2(M)+U) ^ = (3-19) 

v,u u,v 

Here we have used the Wu formula (see for a quick proof in the simply-connected case) 
which asserts that v 2 = v ■ ui2(M) mod 2 for every vector v G H 2 (M, Z2), where W2(M) 
is the second Stiefel- Whitney class of M. Thus the expansion at the third cusp gives the 
partition function with v = W2(M). 

An Interesting Self-Dual Group 

The transformation property ( |3.15| ) can be in part tested (and was originally guessed) 
by the following considerations. Consider the group 

SU(N) x SU(N) 

where the Zn is embedded diagonally in the product of the centers of the two SU(Nys. 
This is a self-dual group. Its weight lattice is the sublattice of the product of the two 
SU(N) weight lattices given by the condition that the difference of weights is in the root 
lattice; this is a self-dual lattice. Note that since the Z^r is embedded diagonally in 
SU(N) x SU(N), an (SU(N) x SU{N))/Z N bundle is an (SU(N)/Z N ) 2 bundle such that 
the two magnetic flux vectors are equal. Allowing for the possibility of different coupling 
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constants for the two S77(iV)'s, the partition function of the SU(N) x SU(N)/Z N theory 



is 



z su(N)xSU(N)/z N = N 1+bl ^ Z v {q 1 )Z v (q 2 ) 

v 

where we have denoted the instanton counting parameters as gi and q 2 - (The prefactor 
has the same origin as in ( |3.17| ).) Since the group is self-dual, S'-duality says that Z 
should be invariant (up to a factor associated with the modular weight) under simultaneous 
transformations T\ — > — 1/rj for i = 1,2. In order to have a nice action also for T, it is 
convenient to choose r 2 =T\. This in particular implies that the 6 angles have opposite 
signs and therefore the fractionality of the instanton number ( |3.9j ) cancels between the 
two groups; so the partition function is invariant under r — > r + 1. With this choice 
the partition function is therefore SL(2, Z)-invariant up to a factor involving the modular 
weight; in what follows we divide by a power of \rj\ 2w to remove this factor. The partition 
function so corrected is 

Z SU(N)xSU(N)/Z N = N~ 1+bl ^2 \ Z v(q)\ 2 

v 

So we learn that the "partition functions" Z v transform as a unitary representation of 
SL(2, Z). This is a consequence of equations ( |3.14| ) and (|3.15| ), as we will verify below; 
that is an important check on our ansatz. 



Analogy With Rational Conformal Field Theory 

This structure is reminiscent of rational conformal field theory, with the Z v playing 
the role of the conformal blocks. To pursue the analogy, consider for simplicity the case 
that H 2 (M, Z) is one dimensional with the lattice generated by a vector e with e 2 = —1. 
(This situation arises for CP 2 with the opposite of the usual complex orientation or in 
connection with the blow-up of a point on a complex surface.) Then the partition functions 
are Z re for r = 0, N — 1. 

This is reminiscent of the partition function of a two-dimensionsal rational conformal 
theory with iV blocks. In fact the T transformation (|3.14j ) in this case is exactly the same 
as for the SU(N) level 1 WZW model; this model has one block for each conjugacy class of 
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SU(N) and the conformal weight for each one is given (mod 1) by w 2 /2 where w is a weight 
vector in that conjugacy class - precisely the quantity appearing in ( |3.12| ). Therefore they 
have the same T matrix (up to at most an overall phase coming from c). Meanwhile, S is 
strongly constrained from the condition that (ST) 3 = 1 and in fact the transformation law 
under S in (|3.15| ) coincides (up to an overall sign) with that of the SU (N) WZW theory 



at level 1. 

This does not necessarily imply that the Z re 's equal the characters of SU(N) at level 
1, but only that they transform the same way. In §4, we will see that in a particular case 
- involving blow-ups and G = SU (2) - these functions do coincide. 

Verification Of Modular Behavior 

We now return to the general case of an arbitrary four-manifold M (with no N- 
torsion in H 2 (M, Z), for simplicity). We wish to verify that the formulas (|3.14j ) and 
(j3.15|) give a unitary representation of SL(2, Z). The non-trivial point is to verify that 
(ST) 3 = 1. (Given the known structure of T, this equation highly constrains S and under 
some assumptions on M uniquely determines it; in that sense our ansatz in ( |3.15| ) can 
nearly be derived from conventional S'-duality.) 

In verifying that (ST) 3 = 1, we will also find some interesting restrictions on c. In 
particular we will find that c = (N — l)x mod 4 for SU(N). Later by studying certain 
examples we will prove that for SU(2), c = x thus suggesting that the SU(N) answer 
is also c = (N — l)x- The same mod 4 condition can be shown for simply laced groups 
with (N — 1) replaced by the rank of the group. For non-simply laced group G, it is 
tempting given the analogy with conformal field theory to conjecture that c = c±(G)x 
where ci(G) = dimG/(l + h(G)) is the central charge of the WZW theory with target G 
at level 1 (h(G) is the dual Coxeter number of G). 

The {Z v }, form an N bz dimensional representation of SL(2, Z) that is described explic- 
itly in ( |3.14| ) and ( |3.15|) . One can use those formulas to check the statement (ST) 3 = 1. 
When H 2 (M, Z) is one-dimensional the fact that the partition functions transform the 
same way as the characters of SU(N) at level 1 implies that (ST) 3 = 1 (with a suitable 
choice of c). It is easy to generalize this to the arbitrary case. Using the self-duality of 
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H 2 (M : Z) (and in particular using (|3.16|) ) we find that (ST) 3 = 1 holds up to an overall 
factor, which disappears if 

(exp(27rzc/24)) 3 = ±N~ b ^ 2 ^(-1)^^ (3.20) 

V 

where uj = exp(27ri / TV) . We will refer to the right hand side of ( |3 . 2 0|) (with the + sign) 
as Ajsr{L). An(L) depends on N and on the lattice L = H 2 (M,Z) (with its canonical 
quadratic form). We would like to compute An(L) for an arbitrary self-dual lattice L and 
verify ( |3.20| ). It is easy to prove the following properties for A^(L): 

\A N (L)\ = 1 



A N (-L)=A N (L) 

A N (L 1 ®L 2 ) = A N (L 1 )A N (L 2 ). (3.21) 

Here — L denotes the same lattice lattice as L except with the opposite sign for the 
quadratic form. Let denote the one dimensional lattices generated by a vector v with 
v 2 = ±1. Then [^9j every indefinite odd self-dual lattice!! is a direct sum of the form 



r + I + © r_J 
So we learn from ( |3.21| ) that for such lattices 

a n (l) = A N (i+y+A N (i-y- = A N {i + y+- r - = A N (i+y^ (3.22) 

where a = r + — r_ is the signature of the lattice. Moreover, it is known E| that every 
self-dual lattice can be transformed to an odd, indefinite self-dual lattice by taking a direct 
sum, if necessary, with I + or I~ . Therefore using ( |3.21| ) we learn that ( J3.22[ ) is true for all 
lattices, and we are left just with computing An(I + ): 

N-l 

O,' 



1 ^-1 

A N (I+) = — ^(-l) s exp(2 7 rz S 2 /2iV) 



s=0 



24 How to compute such sums is explained in [49], as was pointed out to us by Dick Gross. 

25 An odd lattice is simply a lattice such that v ■ v is odd for some v. 
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This sum can be computed using the Poisson resummation technique (the sum is a simple 
generalization of so-called "Gauss sums" (see appendix 4 of @)). We learn 



which implies 



and so 



, / +x -2m(N - 1) . 
A N (I+) = exp K - >- (3.23) 



2nic -2m(N-l)a 
exp — = ±exp K — (3.24) 



c = — {N — l)a + 4e mod 8 
where e = 0, 1 depending on the ± in ( |3.24[ ). We can rewrite c as 

c = (N — l)x — (N — + cr) + 4e mod 8 

Note that if N is odd, then N — 1 is even; and since x + ° = mod 2, (N — l)(x + 
a) = mod 4. For iV even, the partition function vanishes unless x + a = mod 4 
(otherwise instanton moduli space is odd dimensional and its Euler characteristic in the 
relevant sense is zero). Thus in either case for a non-trivial partition function we have 
(N - 1) (x + cr) = mod 4, and so 

c = (N — l)x mod 4 (3.25) 

We will see in §4 that at least for N = 2, this equation for c is true identically and 
not just mod 4. This will in particular imply 

« = (JV -^ + a) mod 2 (3.26) 

Note that we have thus determined the ± sign in ( |3.15|) to be (— l) e . In the case of SU(2) 
note that (— l) e = (— 1)" where v = We will use this fact later on. 



4. Testing the Predictions of S'-duality 

In this section we will test the predictions made in the previous section in a small but 
satisfying set of examples. The examples we consider, mostly with gauge group SU (2), are, 
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Ill turn, K3, CP 2 , blow ups of Kahler manifolds, and ALE spaces. These examples not only 
provide strong coupling tests of 5-duality but also fix the quantities c and w which were not 
completely fixed in §3 to be for 577(2) c = x an d w = — x- We will also find a little bit of a 
surprise in the context of our discussion of CP 2 . We find that the 5-duality predicts that 
there are holomorphic anomalies (a dependence of the partition function on topologically 
trivial observables) in the topologically twisted N = 4 super Yang-Mills theory. These 
holomorphic anomalies are somewhat reminiscent of holomorphic anomalies that arise for 
certain two-dimensional topological theories [33] . There also may be a relation to a sort of 



anomaly that arises in Donaldson theory on certain manifolds. Both in the case of blowups 
and for ALE spaces, the analogy with rational conformal field theory that we have seen 
already will reappear. 

4.1. N = 4 Yang-Mills on K3 

For our first example, we study the partition function of N = 4 Yang-Mills theory 
on K3. This turns out to be computable because of the existence of nice constructions of 
instantons on K3. It is also a particularly nice example because - as it is hyper-Kahler 
- the physical model coincides with the topological model if one orients K3 correctly (so 
that holomorphic vector bundles are instantons). Furthermore, the vanishing theorem of 
§2 applies to K3, so we can expect to compute just in terms of instantons. 

In general, the dimension of the 577(2) instanton moduli space A4f~ of instanton num- 
ber k is 

3 

dmxA/U = 8k - -(x + o") (4.1) 

where x an d cr are the Euler characteristic and the signature of the manifold respectively. 
For K3, one has x = 24 and a = —16, so 

dimMk = 8k - 12 

If E is an 577(2) instanton bundle on M = K3 with instanton number k, one can 
seek [p3| - p4| a convenient description of E by finding a line bundle L on K3 such that the 



index of the d operator coupled to L 1 <g> E is l.t^l It will then be generically true that 



26 These matters were explained to us by P. Kronheimer. 
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H°(M, L -1 ® E 1 ) has (up to scalar multiple) a single holomorphic section s. The number 
of zeroes of s, counted with multiplicity, is equal to the second Chern class of L -1 <g> E, 
which (with L and E as stated) turns out to be 2k — 3. Thus, if L exists, one has a natural 
way to extract from E a configuration of 2k — 3 points on K3, namely the zeroes of s. 
Conversely (when an appropriate L exists), given a configuration of 2k — 3 points on K3, 
one can reconstruct a unique E, using a process of extension of sheaves due originally to 
Serre. 

A configuration of 2k — 3 points on the four-manifold M depends on 8k — 12 real 
parameters. This number equals the dimension of instanton moduli space as given above. 
In fact it is true in this situation that the instanton moduli space can be identified with 
the space of configurations of 2k — 3 distinct (but unordered) points on KS, and is, in 
particular, hyper-Kahler. If the points are permitted to coincide, one gets a compact hyper- 
Kahler manifold with orbifold singularities. A resolution of the singularities of that moduli 
space, preserving the hyper-Kahler structure, gives the algebrogeometric compactification 
of instanton moduli space. Such resolutions all have the same Betti numbers and Euler 
characteristic and can be studied by standard orbifold methods that we use below. 

A line bundle L with the properties needed for the above construction exists if and 
only if k is odd and a suitable complex structure is picked on K3. (For different fc's one 
needs different complex structures.) The restriction on the complex structure will not 
cause difficulties, because the partition function we are trying to compute is independent 
of the complex structure. The restriction to odd k will result in the appearance of an extra 
function below. 

The construction just sketched can also be carried out for 50(3) bundles with non- 
zero W2- In this case, a suitable L always exists (if the complex structure on K3 is 
suitably chosen). The instanton number k might be half-integral (K3 is a spin manifold 
so a denominator of 4 cannot arise), but 2k — 3 is always integral and the moduli spaces 
with W2 7^ have the Euler characteristic of (a hyper-Kahler resolution of) the symmetric 
product of 2k — 3 copies of K3. 

Even though we do not have any convenient description of the moduli spaces with 
W2 = and even k, we will be able from the facts stated above both to test >S-duality and 
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to use it to predict the Euler characteristics in the missing cases. 



SO(3) Bundles On K3 

We will need to know some facts about SO (3) bundles on K3. Of course, in addition 
to the instanton number, such a bundle E is classified topologically by the "magnetic flux" 
v = W2(E). As H 2 (K3, Z) is 22 dimensional (and torsion- free) , v can take 2 22 values. In 
the 50(3) theory, we must sum over them. 

There is no need to study separately 2 22 possibilities because K3 has a very large 
diffeomorphism group which permutes the possible values of v. One obvious diffeomor- 
phism invariant of v is the value of v 2 modulo 4; if it is we call v even and if it is 2 we 
call v odd. If v is odd, it is certainly non-zero, but for v even there is one more obvious 
invariant: whether v is zero or not. It turns out that up to diffeomorphism, the invariants 
just stated are the only invariants of v. So on K3 there are really three partition functions 
to compute, namely the partition functions for v = 0, v even but non-zero, and v odd. We 
call these Zq, Z even , and Z a dd- Similarly, we write no, n even , and n Q dd for the number of 
values of v that are, respectively, trivial, even but non-trivial, and odd. 

Let us count how many u's of each type there are. It turns out that for this purpose, 
the intersection form on H 2 {K3, Z) can be replaced by the sum of 11 copies of 

--(;;). <-) 

This makes the combinatorics of counting the different kinds of bundle straightforward 
- in fact, equivalent to the combinatorics of counting the number of even and odd spin 
structure on a Riemann surface of genus 11, except that one must separate out the case of 
zero flux. The result is 

no = 1 trivial type 
2 22 4- 2 11 

rieven = 1 even type 

222 _ 211 
n dd = odd type 

So in particular the SU(2) and 50(3) answers are 

Zsu{2) = 
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ZsO(3) — + n evenZ e ven + ^oddZodd 

(The factor of 1/2 in the first equation comes from the factor of 1/#Z(G) in (|3.3| ).) 



Euler Characteristic Of A Symmetric Product 

According to the description sketched above of the instanton moduli spaces on K3, 
we need to calculate the Euler characteristic of a symmetric product of -O's, that is, of 
the quotient (K3 x K3 x . . . x K3)/S n of the product of n copies of K3 by the group S n of 
permutations of n objects; S n acts by permuting the factors. The hyper-Kahler condition 
means that one can use the orbifold description |5l[] of the cohomology of the quotient of 



a manifold by a finite group. The result we need has been computed before [[52] and has 
even been obtained using the orbifold formula [p3 |. However, we will include a derivation 



for completeness. It is actually convenient, and no more difficult, to apply the orbifold 
formula to the symmetric product of an arbitrary manifold M (the orbifold formula is 
only expected to agree with the cohomology of a resolution of the symmetric product if 
M is hyper-Kahler or at least Calabi-Yau). In the computation, we will aim to describe 
the cohomology of the symmetric product, and not just compute the Euler characteristic. 
This will make the meaning of the result clearer. 

First we recall the general construction of the cohomology, in the orbifold sense, of a 
quotient X/Y, with X a manifold acted on by a discrete group V. It is constructed as 

H*(X/Y) = ® 1 H 1 (4.3) 

where 7 runs over conjugacy classes in Y (for each conjugacy class we pick a representative 
that we also call 7) and if 7 is the cohomology in the sector "twisted" by 7. This is obtained 
as follows. Let X 1 be the subset of X left fixed by 7. Let N 1 be the subgroup of Y that 
commutes with 7. Let H*(X 1 ) be the cohomology of X 1 and let H*(X J ) N ^ be the part 
of H*(X J ) that is invariant under iV 7 . Then 

H n = H*(X~*) N ~<. (4.4) 

In the case at hand, X is the product M n = MxMx...M of n copies of M, and 
T is the group S n of permutations of the factors. We also want to take the sum over n. It 
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is convenient, as we will see, to introduce an operator Lq that acts on H*(M n /S n ) with 
eigenvalue n, and to write the sum over n as 



H = ®^ =0 q n H%M n /S n ) (4.5) 

where the powers of q are a formal way to keep track of the Lq action. 7i will turn out to 
be a kind of Fock space. 

First let us work out using the definition in Ql4|) the contribution of the untwisted 
sector. For 7 = 1, M 1 = M n . The cohomology of M" is H*(M n ) = H*(M)® n , the tensor 
product of n copies of the cohomology of M. Also, N\ = S n , the full permutation group. 
So Hi is the S^-invariant part of (H* (M))® n . This can be interpreted as follows. If we 
think of an element of H*(M) as a "one particle state" and an element of H*{M)® n as 
an "n particle state," then S n invariance means that we should think of the n particles as 
being identical bosons or fermionsil and impose bose and fermi statistics. If therefore we 
pick a basis w a of the cohomology of M, and introduce a corresponding set of "creation 
operators" a a _ x acting on a "Fock vacuum" then the contribution of the untwisted 
sectors to ( P~5| ) is a Fock space generated by the a a L 1 . 

We now want to show that the inclusion of twisted sectors results instead in a Fock 
space generated by a "stringy" set of oscillators a°_ n n = 1, 2, 3, ... , which have L = n 
just as one would expect in string theory. We have to remember that every element of S n 
can be decomposed in disjoint cycles, for instance a permutation of six objects might take 
the form of a one-cycle, a two-cycle, and a three-cycle, often denoted (1) (23) (456). The 
conjugacy classes in S n are labeled by giving the number ni of /-cycles for / = 1, 2, 3 . . .. 
The ni are arbitrary non-negative integers except for the obvious restriction 



n 

1 



!>'■ (4-6) 



iV 7 is of the form 



iV 7 = J]iV 7 (/) (4.7) 



1=1 



27 That is, the cohomology classes of even and odd dimension correspond to bosons and fermions 
respectively. 
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where N 1 (l) is the subgroup of iV 7 consisting of permutations that act non-trivially only 
on objects in the /-cycles of 7. This factorization, which holds because the order of an 
object under 7 is invariant under iV 7 , is reflected in a similar factorization of 7i. 

It is convenient to consider first the contributions of /-cycles of a fixed /. Suppose that 
7 consists only of /-cycles, say k of them. Then of course n = kl, and M n is a product 
of n copies of M, divided into k sets of / such copies; each set is permuted cyclically by 
7. The fixed point set (M n ) 1 is a product of k copies of M, one for each /-cycle, so its 
cohomology is H* \M)® k . iV 7 again acts on H*(M)® k by permuting the factors, and again 
this means that if we think of the elements of H*(M) as "one particle states," then we 
should impose bose and fermi statistics on the u k particle states" in H*(M)® k . Also, we 
want to sum over k = 0, 1, 2 .... So if we introduce a new set of creation operators at t , 
then the contribution to 7i from conjugacy classes consisting of /-cycles only is the Fock 
space generated by the at L acting on the Fock vacuum. There is one novelty: at x has 
Lq = I because of the relation n = kl, or if you will because taking the fixed points of 7 
collapsed all the copies of M in an / cycle into a single copy, the diagonal. 

Now it is easy to put the results together. Once we sum over n the restriction in ( }4.6| ) 
loses its force and the ni are independent. The fixed point set of a general permutation 
7 contains a factor of M for each /-cycle in 7 of any /. So the cohomology H*(M™) has 
a factor of H*(M) for each cycle. The symmetry group iV 7 acts separately, according 
to (|4.7| ), on the cycles of different order. So bose and fermi statistics are only imposed 
on "particles" of the same /. The net effect of this is that if we introduce the creation 
operators at n for all positive integers n, then H is the Fock space generated by the at n , 
as claimed above. 

For instance, if 6+ and b- are the dimensions of the bosonic and fermionic subspaces 
of H*(M), then we can write a simple result for the generating functional of the number 
of states, 



We can also write a simple result for the generating function of the Euler characteristics, 




(4.8) 




(4.9) 
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Happily, up to elementary factors, those formulas are modular. In particular the generating 
function for the Euler characteristic after multiplying by q~ x ^ 24 becomes 

G(q) = <r x/24 9"X(M7S,) = \ (4.10) 

where rj is the Dedekind eta-function. 

Instantons On K3 

Now we return to the computation of Euler characteristic of instantons on K3. For 
K3, equation ( f4.10|) gives 

G{q) = -4' 

which is none other than the partition function of left-moving modes of the bosonic string! 

We can now calculate the Euler characteristic of the moduli spaces of instantons on 
K3. It is easier to consider first the SO (3) bundles with nonzero W2 because then the 
description by a symmetric product of -fT3's work for all values of the instanton number. 
We recall that if the instanton number is k, then the number of points is n = 2k — 3, so 
for even (or odd) bundles only odd (or even) n contributes. Also the formula n = 2k — 3 
shows that adding another copy of K3 adds only 1/2 to the instanton number, so if we 
want to count instantons with powers of q, we must count points on K3 with powers of 
g 1 / 2 . So we can evaluate the partition functions 

Z = q- s J2l n x(M n ). (4.11) 

n 

If we pick s = 2, we get 

Z even (q) = \(G(q 1/2 ) + G(-q 1 / 2 )) 

1 V J (4.12) 

ZoaM = \ (G(qV*) - G{-qV*j) . 

These formulas are modular (of weight —12), giving finally our first strong coupling test 
of S'-duality. In writing (|4.12|) , we added or subtracted G(q 1 ' 2 ) and G(— g 1//2 ) to project 
onto terms with an even or odd number of copies of K3. 
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The formula dim.M/ c = 8k — 12 for the dimension of instanton moduli space shows 
that the smallest value of k for which Aik is non-emptyi! is k = 3/2. The leading behavior 
Z dd ~ q~ 1 / 2 of Z dd comes from an instanton of k = 3/2; its contribution is shifted from 
q 3 / 2 to q~ x l 2 by the overall factor of q~ 2 that comes from picking s = 2. Similarly, the 
leading behavior Z even ~ 1 of Z even is the shifted contribution from k = 2. 

The contribution of the trivial £0(3) bundle on can be treated similarly, with one 
difference: since the description of the moduli spaces by symmetric products of K3 is in 
this case only valid for odd instanton number, we have to add an unknown function F(q 2 ) 
contributing to the terms with even instanton number. The partition function is thus of 
the form 

Zo(q) = F(q 2 ) + \{G{q 1 ' 2 ) + Gi-q 1 / 2 )) (4.13) 

where we have multiplied by the same factor of q~ 2 . 

Now we will make a precise test of S'-duality in its sharpened form of equation ( |3.15| ) . 
This predicts the transformation law under r -> -1/r to be 

a/2 



-li 



2 

i 



T 



J SO(3) 



11 /T\ I io 10 

Zeven ~^ 2 y-rj \Zq + (2 — l)Z even — 2 Z dd\ 

Zodd - 2" 11 ( T ) [Z + (-2 10 - l)Z even + 2 10 Z odd \. (4.14) 
To obtain these formulas from ( |3.15| ), one needs to evaluate, for given i>, the sums 

u odd u even 

This is easily done using the fact that modulo two the intersection form on H 2 (K3, Z) is 
equivalent to 11 copies of H, and picking a convenient choice of v. 

Since we have incomplete information about Zq, we can eliminate it from the above 
to get the prediction that under r — > —1/r, 

7 —7 — ^ ( — \ w / 2 (7 —7 \ 
^even ^odd \ . ) \^even ^odd) 

I 



28 Generically, and with an exception discussed below. 
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which is indeed the case 

Z even - Z odd = Gi-q 1 / 2 ) - r-^Gi-q 1 ' 2 ). 

In particular, the modular weight is —12, and w = —24. But we can go further and 
determine F(q 2 ). Using the modular transformation properties 

G(-q 1/2 ) - r- 12 G(- ? 1 /2) 
G ( ? i/2) _ 2 - 12 r- 12 G(( / 2 ) 

G ( 5 2) ^ 2 12 r -12 G(? l/2 ) 



we find that all of the equations ( |4.14| ) are satisfied if and only if 



F(q 2 ) = \G{q 2 ) 

That there exists an F which satisfies all three equations at the same time is a very precise 
further test of 5-duality. 

Note that the leading behavior of F(q 2 ) for small q is F(q 2 ) ~ q~ 2 + constant. Thus 
the partition function Zq for zero flux has an expansion 

Z ~^ + O(l). (4.16) 

The contribution of order q~ 2 must be interpreted as the contribution of the trivial con- 
nection, shifted from q° to q~ 2 because we have multiplied the whole series by q~ 2 . 
There is indeed one exception to the statement that instantons generically do not ex- 
ist for 8k — 12 < 0: the trivial connection always exists, even though one might think 
that generically it shouldn't. Evidently, it contributes to the twisted N = 4 theory, and 
(for SO (3)) its contribution has the somewhat mysterious value 1/4. On the other hand, 
8A; — 12 is negative for k = 1, and for k = 1 there are in fact generically no instantons. 
That explains the absence of a term of order q~ x in ( |4. 16| ) . 

Of course, we can reassemble our results into formulas for the 577(2) and 50(3) 
partition functions. The SU(2) partition function, allowing for the usual factor of 1/2, is 



Z 



SU{2) 



\G{q 2 ) + -Mq 1 ' 2 ) + -Gi-q 1 ' 2 ) (4.17) 



4 
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This transforms as a modular form of To (2), as it should, since it is invariant under T 
and transforms correctly under ST 2 S. This formula makes predictions for the Euler char- 
acteristics of the moduli spaces with W2 = and even instanton number; the predictions 
contain mysterious denominators of 1/4 which must somehow reflect the singularities of 
these moduli spaces. The 50(3) formula is similarly 

Zso(3) = \G{q 2 ) + 2 21 G{q 1 ' 2 ) + 2 10 G(- ? 1 /2). (4 . 18) 

and again has modular properties for To (2). 

Since w = —24 and s = —2, the "central charge" defined in ( |3.7| ) is c = 24. This agrees 
with the general arguments discussed in the last section ( |3.25| ) which gave c = x = 24 mod 
4. We see that the equality is true even if we remove the mod 4 condition. We also see 
that at least in this case w = —\ = —24. Since in general c and w are linear combinations 
of x an d or we need one other example with a different ratio of x an d cr to completely fix c 
and w in general; we will find that the coefficients of a vanish (as one might have expected 
from parity conservation were it not that parity is explicitly broken by the twisting used 
to construct a topological field theory). 

Before leaving K?> 1 we will point out a fairly natural guess for the partition function 
for the SU(N) model on K3. We will state the guess for the partition function Zq of 
bundles with zero magnetic flux. Contributions of the other bundles are determined by 
modular transformations. Our guess is 

G(q l / N ) + G(ujq 1/N ) + ... + G(uj N ~ 1 q 1/N ) 

where u = exp(27rz/iV). The formula has some attractive properties (but considerations of 
its modular properties as well as some considerations explained at the end of §5.3 suggest 
that it may be valid, if at all, only for iV prime). For SU(N) the dimension of instanton 
moduli space on K3 is 4kN — 4(iV 2 — 1), so after the trivial instanton at instanton number 
0, instantons appear first at k = N. That agrees with the fact that in the above formula 
Zq ~ q~ N /N 2 + O(l) with a gap between q~ N and q°. The modular transform of the 
formula for Zq can also be seen to give the right gaps for bundles with non-zero flux. 
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The above guess suggests that the instanton moduli spaces for integral instanton 
numbers that are not mod N (or perhaps only those prime to N, in view of what we 
find in §5.3), can be identified with appropriate symmetric products of i^3's. At least 
for instanton numbers 1 mod iV there are some indications of this ||54j| . Our guess is also 
compatible with c = (N — l)x in accord with (|3.25| ), and with w = —\ (in other words 
this guess suggests that the modular weight is independent of the gauge group for a simple 
group; this is also supported by the analysis discussed below on ALE spaces). 

4.2. CP 2 

We now consider the case of CP 2 , again with gauge group SU{2) or SO{3). CP 2 is 
another case in which the vanishing theorems of §2 apply, so it would seem that we can 
compute from instantons only. We will actually run into a surprise, a kind of anomaly that 
affects holomorphy. 

As i7 2 (CP 2 ) is one-dimensional, an SO (3) bundle E has two possible values of v = 
u>2(E). There is v = 0, with v 2 = 0; and there is v ^ 0, with v 2 = — 1 modulo 4. 
Accordingly, there are two partition functions, 

Z = q- s J2x(Mo,n)q n 

(4.19) 

Z 1 = q- s Y,x(M 1>n )q n , 

n 

where -Mo,n and M.i >n are, respectively, moduli spaces of bundles with v = and v ^ 0, 
and instanton number n. To study these functions, we rely on formulas of Yoshioka 
and Klyachko |2^,^8|. Yoshioka determined a formula for Z±, while Klyachko determined 



formulas for the related functions 



3C 



Yi(q)=q- s Y,x(M^i, n )q n , (4.20) 

n=0 

where Ai^\ : n is the uncompactified moduli space of instantons of instanton number n and 
magnetic flux determined by i. 
Klyachko' s formula for Y\ is 

oo 

Y 1 {q) = 3^H{An-l)q n - 1 ^ (4.21) 

n=l 
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where H{m) is the Hurwitz function that equals the number of equivalence classes of 
integral binary quadratic forms of discriminant m, weighted by their number of automor- 
phisms.il A formula of YoshiokaH shows that for any Kahler manifold of Euler character- 
istic x, if we take s = — x/12 then 

Y v (q) 



Zv(q) 



(4.22) 



v(q) 2x 

for appropriate non-zero v; the formula applies for v ^ on CP 2 . (The fact that the 
modular function r\ appears here should hopefully be related to S'-duality, though we do 
not know how.) Combining these formulas, one has 

3 x 



Zi(q) 



^2H(4n-l)q r 



(4.23) 



(Yoskioka also obtained directly |2f| a formula for Z\ which must coincide with this formula 
- we checked this for the first few terms - but the modular properties are easier to see in 
( |4.23|) .) Since the modular properties of ( [4. 23|) , which will be discussed presently, would 
be ruined by multiplying by a power of g, the value of s is the one required to make the 
formula Q4.22 ) modular, or 

s = (4.24) 

which is the same result that we found for K3. Similarly, as the modular weight of Z\ is 
—3/2 (in a suitable sense that will be explained) we can determine in general (using also 
the K3 result) that the modular weight is — %/2 or that 



w 



-X- 



(4.25) 



We will have a further check on ( 4.24 ) and Q4.25 ) when we consider blowups. 



Klyachko's formula for Yq has an extra term relative to (|4.21[ ), but Yoskioka's analog 
of (|4.22|) for v = is also more complicatediil; combining them, it appears that Zq is the 



obvious analog of ( 4.23 ): 



' n=0 



(4.26) 



29 H(m) vanishes unless m is congruent to or —1 modulo 4. 

30 This follows from the Weil conjectures and Theorem 0.4 of [26]. 
31 



This formula is unpublished and we are grateful to Yoshioka for providing it to us. 
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The series in ( |4.23| ) and ( [4.26 ) are known as Eisenstein series of weight 3/2. For a relatively 



elementary introduction to such series see |47|], section IV. 2, especially p. 194; see also ||55]| . 



The simplest Eisenstein series of half-integral weight for To (4) are defined by series such 



as 



E k/2 (z) = J2j(l^r k - (4-27) 



7 



Here the sum runs over certain cosets of To (4), and z) k generalizes the factor (roz + 
n)~ k that appears in the conventional Eisenstein series 

G k (z) = J2( mz + n )~ k - ( 4 - 28 ) 



For k an odd integer > 5, (|4.27 ) defines a modular form of weight k/2 for Tq(4). However, 



for k = 3, the sum in (|4.27| ) does not converge quite well enough to define a holomorphic 
modular form. To define a modular object, one regularizes the sum, replacing (|4.27|) with 

E 3/2 ( Zl s) = J2jh^r 3 \j\- 2s - (4-29) 

7 

With this regularization, E^/ 2 {z,s) transforms as a modular form of weight 3/2 (for any 
fixed s) but is not holomorphic in z. One can actually take the limit s — > 0; the function 
^3/2(2) = E^/ 2 (z, 0) that one obtains this way is modular but not holomorphic. The story 
is just analogous for the ordinary Eisenstein series Q4.29Q for k = 2, which similarly does 
not quite converge well enough to define a holomorphic modular form. 

As was discovered twenty years ago by Zagier |32| (see also [^3| ), the process just 
described, starting with a slightly different Eisenstein series of weight 3/2, gives such 
functions as 



f = ZH(4n)q n + 6r 2 " 1 /2 ^ (3(47vn 2 r 2 )q- n2 

n>0 nEZ 
n>0 neZ 

where q = e 27riT , t 2 = Im(r), and 



I) 2 

2 > 



(4.30) 



Pit) = -7T- I u~ 3/2 exp(-ut) du 
16tv 
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Thus, the modular status of ( |4.23|) and ( 4.26 ) is clear: these functions are the "holomorphic 



part" of the non- holomorphic modular functions in ( |4.30|) (divided by rf). The "holomor- 
phic part" can be more formally defined by considering the limit of r — > oo while keeping 
r fixed. It is easy to see that the additional non-holomorphic terms vanish in this limit. 
Moreoverll under r — > — 1/r, these functions transform as 

This is exactly the transformation law predicted in ( |3.15| ). Since fo is invariant under T 
and fi under T 4 , it follows, for instance, that fo has the expected modular transformation 
law under the group To (4) generated by T and ST 4 S. 

Holomorphic Anomaly? 

Thus, all is well except that the Z v are not holomorphic. This is reminiscent of the 
holomorphic anomaly in certain topological two-dimensional models |34j. The situation 



there is as follows: One has an untwisted physical theory with coupling constants r and 
r, parametrizing some moduli space, appearing in the action. In the physical theory r is 
the complex conjugate of r. One then considers the twisted version and shows that the 
t dependence in the action is of the form r{Q, ...}, and so formally the path integral is 
independent of r . One then takes the unphysical limit r — > oo keeping r fixed to argue that 
the twisted theory computes the Euler characteristic of a certain moduli space. However, 
under certain circumstances p£|| , the formal independence of the path integral from r 



is anomalous and the twisted theory has nice modular properties only when r is chosen 
to be the complex conjugate of r. Nevertheless the topological answer is recovered, and 
modularity lost, by taking r — > oo and keeping r fixed. This is very similar to the above 
situation for the N = 4 twisted Yang-Mills on CP 2 , with the obvious substitutions. In 
particular the S-duality conjecture seems to suggest that the twisted physical theory on 
CP 2 has t dependence. 

Another precedent, perhaps equally relevant, is Donaldson theory which (as Donald- 
son discovered in one of his early papers |56|] ) fails to produce topological invariants on 



32 According to an unpublished argument kindly explained to us by D. Zagier. 
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manifolds with &2 + = 1- CP 2 has &2 + = 1 (though in Donaldson theory it is not a typical 
example with 62 + = 1 since also =0). 

The anomaly in Donaldson theory comes from abelian connections where zero modes 
appear for the fields called <f>, (p in §2. Though this anomaly has not been given its proper 
physical expression, it very plausibly involves the essential failure of compactness of field 
space due to the flat directions in the <f> potential. (Those flat directions are suppressed 
when the gauge field has SU{2) holonomy.) The condition 62 + = 1 for the anomaly looks 
like it might be natural from this point of view, since for 62 + > 1 there are extra fermion 
zero modes along the flat directions. 

The behavior of the twisted N = 4 theory on CP suggests that it too may have 
an anomaly associated with abelian configurations when &2 + = 1- I n fact, the instanton 
numbers of abelian configurations on CP 2 are — n 2 for v = and — (n + 1/2) 2 for v = 1. 
These are precisely the exponents that appear in the anomalous, non-holomorphic sum in 
( |4.30| ) . The failure of holomorphy of the fi can be summarized in the elegant equations 

d^ f °-mH T2 ^ q 

a o nez (4-32) 

df H ~\^i 2 ^ Q 

nez 

which give further encouragement for seeking a field theoretic explanation of the anomaly. 



The formulas of Yoskioka |27| for instantons on rational ruled surfaces (which all have 
62 + = 1) strongly suggest that on this whole class of manifolds there is some sort of 
anomaly generalizing the one that we have found experimentally on CP 2 . 

4.3. Blow Ups 

Our next example will involve the behavior under blowing up a point on a Kahler 
manifold. Thus, we let M be a Kahler manifold, and M the new manifold obtained by 
blowing up a point p G M. Topologically, the effect of the blow-up is to glue in into M a 
copy of CP (that is, CP 2 with the opposite of the usual complex orientation). 

The effect on the second homology is that the lattice L of M is replaced by the lattice 

L = L®I~ (4.33) 
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for M (where / , as in §3, is the one dimensional lattice with quadratic form —x 2 ). If E 
is an SO (3) bundle on M, then v = W2(E) can be decomposed as 

v = v © r (4.34) 

under the decomposition ( |4.33| ). 

In equation ( |3.15|) , we proposed a precise formula for how the partition functions 
transform under ^-duality. Since L is a direct sum, the formula implies that the action 
of SL(2, Z) on Z~ = Z v>r is the product of a representation acting on the v label and a 
representation acting on the r label. Thus, the action of SL{2, Z) is compatible with the 
possibility that the Z's have a factorization as 

Z m^ = Z M,vZ r (4.35) 

where for clarity we write the manifold M or M explicitly, but the function Z r could be 
"universal," independent of M. 

The possible Z r J s can be analyzed just as in §3.3, where we considered a manifold 



with the intersection form I~; if a factorization such as (|4.35|) holds, that discussion can 
be applied to the blowup of an arbitrary M. In that discussion, we pointed out that (for 
SU(N), not just SU(2)) the modular transformations of the Z r are those of the level one 
characters of the SU(N) WZW model. However, the modular transformations alone do 
not imply that a factorization such as Q4.35| ) holds, or if it does that the Z r are the WZW 
characters. 

These assertions are, however, true, at least in one important case, according to 
another formula of Yoshioka (||26|], proposition (0.3)). He demonstrates that under blowup 
of a Kahler manifold, the factorization of ( |4.35| ) holds for SU(2) and r = 0, with 

z o = -r-\2> ( 4 - 36 ) 

where 

Bob) = E^ 2 - ( 4 - 37 ) 

This is indeed 1/rj times the appropriate SU(2) character at level 1. 
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Under blowup, the Euler characteristic of a manifold is increased by 1. On the other 
hand, the function in (|4.36|) behaves for small q as q~ l t 12 and transforms as a modular form 



of weight —1/2. This gives a further check on our earlier finding that the "zero point" of 
the instanton number is — %/12 and the modular weight of the partition function is — x/2. 
5-duality implies that the factorization ( f4.35|) must also hold for r = 1 and that 

where 

■2 



Oi(q) = Yl «" • ( 439 ) 

neZ+l/2 



4-4- ALE Spaces 



For our last example, following Nakajima |[29| , we consider instantons on ALE (asymp- 
totically locally Euclidean) spaces. In many ways this is the richest example, as precise 
information is available for U (k) or SU (k) gauge group, not just SU (2), HI and one encoun- 
ters WZW models at arbitrary levels, not just level 1. Unfortunately, it is hard to exploit 
the examples fully as one does not know enough about the implications of 5-duality on 
non-compact manifolds. 

The ALE spaces X-p we have in mind arise as hyper-Kahler resolutions of hyper-Kahler 
orbifolds of the form C 2 /r, where F is a finite subgroup of 5£7(2) acting linearly on C 2 . 
These spaces are noncompact relatives of K3. Indeed, K3 has a somewhat analogous con- 
struction beginning with the quotient of a four-torus by a finite group, and the singularities 
so obtained have the local structure of C 2 /r. 

Every finite subgroup T of 577(2) has, up to isomorphism, a finite set of irreducible 
complex representations p^. If p is the two-dimensional representation associated with the 
embedding of Y in SU(2), one can decompose the tensor product p ® pt as the direct 
sum of riij copies of pj; the riij are all or 1. Form a graph with the pi for vertices, 
with two vertices pi, pj connected by a line if and only if 7^ 0. It turns out that 



33 The partition function of the U (k) theory would vanish on a compact four-manifold because 
of fermion zero modes in the £7(1) factor, but that is not so on the ALE spaces, where there are 
no normalizable zero modes. Nakajima's results are most elegantly stated for U(k). 
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the graph so obtained is the extended Dynkin diagram of a simply laced Lie group Hp. 
The correspondence V <-> Hp is the McKay correspondence between finite subgroups of 
SU{2) and simply laced Lie groups. (The subgroup Zn of 577(2) corresponds to Ajv-i, 
the dihedral groups correspond to the D n , and the symmetries of the three regular solids 
correspond to the E series.) 

The Dynkin diagram of Hp appears in the geometry of Xp as the intersection form 
of two-spheres Si created in blowing up the singularities of Xp. These two-spheres give a 
basis of H2(Xp, Z), and the first Chern class of a vector bundle over Xp can be interpreted 
as a weight of Hp. 

The hyper-Kahler metrics on Xp are asymptotic at infinity to R 4 /r. The fundamental 
group at infinity is non-trivial, and admits non-trivial homomorphisms 



to the gauge group G. These homomorphisms enter because requiring that the action of an 
instanton should be finite does not imply that the instanton approaches the trivial connec- 
tion at infinity, but only that it should be flat at infinity, that is, given by a homomorphism 
from r to G. 

Nakajima analyzes instantons with gauge group U(k) on Xp. Such an instanton 
determines at infinity a k dimensional representation ( |4.40|) of T, which decomposes as a 
sum of rii copies of the irreducible representation pi, with 



Since the pi can be interpreted as points on the extended Dynkin diagram of Hp, one gets 
a natural assignment of integers rn to the points of the Dynkin diagram. Those points also 
correspond to simple roots of Hp, so we get an assignment of integers to simple roots. 
This enables us to form the sum = J2i n i e ii which is a positive weight of Hp, so it 
determines a representation of Hp. Even better, equation QL41|) is precisely the condition 
that the representation so determined is a highest weight of an integrable representation 
of Hp current algebra at level k. The instanton number of these instantons is shifted from 
being an integer by an amount that depends on (f>. 



(f> :T —> G 



(4.40) 




(4.41) 
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Nakajima now considers the middle dimensional cohomology H^ >Cl ^ n of the moduli 
space of U(k) instantons with a representation (p at infinity, arbitrary first Chern class 
ci, and instanton number n. (n is shifted from an integer by an amount that depends on 
(p.) More precisely, he considers the cohomology of some complete hyper-Kahler manifolds 
obtained by a particular partial compactification and deformation of singularities of the 
moduli spaces. 

To describe the results, introduce an operator L that has eigenvalue n on -ff</,, Cl , n , 
and introduce the sum 



where the variable q is formally included to keep track of the Lq action. Nakajima's 
remarkable result is that 7i has a natural structure of irreducible representation of Hy 
current algebra at level k, with the highest weight being w^. Lq acts by multiplication 
by n, while Ci, interpreted as we mentioned above as a weight of Hy, determines the 
action of the Cartan subalgebra of Hy- The rest of the action of the affine Lie algebra is 
defined through some operations of twisting vector bundles along the S^; these operations 
conceivably should be regarded as analogs of vertex operators in conformal field theory. 
At any rate, the functions 



are therefore the characters of integrable representations of the loop group of Hy- (c is the 
central charge of Hy current algebra at level k.) So they have modular properties and in 
fact transform in a unitary representation of SL(2, Z). Because of not understanding the 
implications of S-duality on manifolds with boundary, and for other reasons mentioned 
presently, we do not know how to explain the particular representation that arises. 

To interpret the facts just sketched in terms of S'-duality, one might hope that the 
middle dimensional cohomology of the moduli spaces coincides with the L 2 cohomology, in 
which case dimif^ Ci n can perhaps be interpreted as a kind of Euler characteristic. Then 
one would hope to deduce the modularity of (|4.43|) from S'-duality. Unfortunately, there 
are difficulties with this interpretation. In particular the partition function of the twisted 



7~L4> = ® Cl ,nQ n H ( j )! 



(4.42) 




(4.43) 
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N = 4 theory gives the Euler characteristic defined as a kind of curvature integral, not the 
Euler characteristic of L 2 cohomology of the moduli space; therefore it is not obvious how 
to compare Nakajima's results to S'-duality. 

Even if one could predict from S-duality the modular properties of the ALE partition 
function, that would not do justice to Nakajima's story, which involves construction of a 
natural Hilbert space with group action and not just a partition function. It does not seem 
that a natural origin of this would be in four-dimensional field theory - which associates 
Hilbert spaces with quantization on a three-manifold, and a partition function with the 
path integral on a four-manifold, but does not normally produce a Hilbert space associated 
with a four-manifold. It would seem that the full story here should have its origins in a 
supersymmetric theory of dimension > 5, which, in quantization on Xp x R (where R is 
the "time" axis) would produce a Hilbert space of physical states. The middle dimensional 
cohomology of the instanton moduli spaces would appear as a set of BPS-saturated states, 
perhaps making it possible to do justice to the results of [^9[ . Note that the dimension of 
the cohomology of the instanton number k moduli space (on K3 or an ALE space) grows 
exponentially like exp(a\/fc) for some constant a. This implies that in a five dimensional 
theory that would be relevant to Nakajima's results, the multiplicity of particles of mass 
m goes as ey^{a\/m) for some constant a. This growth is huge for a field theory though 
it is somewhat less than what one encounters in string theory (whose spectrum grows as 
exp(am) because m 2 = k). A natural origin for such a huge BPS-saturated spectrum may 
well require string theory. Anyway, the only sensible theory that we know of in dimension 
> 5 is string theory, so some of Nakajima's results may have their natural origin in S- 
duality of string theory. 

SU(k) 

It is also of interest to extract from Nakajima's formulas the results for gauge group 
SU(k). For SU(k) on compact manifolds, we worked out precise predictions of S'-duality 
in §3. We would like to see what happens in the ALE case. 

If one wishes to consider SU(k) rather than U(k) on Xp, one must restrict to bundles 
with ci = 0. As the action of the Cartan subalgebra of Hp is proportional to c±, this entails 
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restriction to states annihilated by the maximal torus T. This suggests that the "partition 
functions," in the above sense, of the SU(k) theory would be the characters of the coset 
conformal field theory Hy/T, times the contribution of extra oscillators coming from T 
(which give a non-zero modular weight). This appears to be so, though a verification 
requires a careful study of the (apparently non-standard) way that Nakajima's partial 
compactification affects the U(l) factor. 

Let us specialize to the the ALE space Xy with Y = Zat, so Hy = SU(N). A 
representation (p of the fundamental group at infinity determines a highest weight of 
SU(N) and also a representation A of the loop group of SU(N) at level k. Nakajima's 
result for U(k) means that (with the partition function Z interpreted in his sense) 

Z Hr (r 1 U(k) l( p) = x A (q) 
where x A is the character of the representation A. % A can be written |5jJ 



A 

where the c A , known as "string functions," are characters of the SU(N)/T coset model 
times the partition function of oscillators of T, and the 0a, fc are certain theta functions. 
The A's are as follows: A is an SU(N) weight that is congruent to w$ modulo the root 
lattice L; also one identifies A = A' if A — A' G kL. Thus A takes (N — l) k values. Since 
L can be identified with ^(Ar), A can be identified with the Z^-valued magnetic flux. 
The Zfc magnetic flux and c\ (which is the magnetic flux for the center £7(1) of U(k)) are 
correlated in this way simply because 

= SU(k) x m 

where is a diagonal subgroup of the center of SU(k) and a Z^ subgroup of U(l). 

How does c A transforms under r — > — 1/r? As is well known in the study of coset 
models, the answer is the product of a matrix acting on the A index and a matrix acting 
on A: 

cj(-l/r) = (-r^-^const. £ 6(A, A', A, \')c a ',(t ) (4.44) 

% A', A' 
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where 



6(A, A', A, A') = exp(27rzA ■ X'/k) e M exp(27rz(A + p)w{M + p)/k + N - 1) (4.45) 

wew 

where W is the Weyl group of SU(N) and p is half the sum of positive roots of SU (N) . The 
factor in ( fl45| ) that acts on the the magnetic flux A is consistent with what one expects 
in ( |3.15|) . The factor acting on A, which is determined by the boundary conditions, is 
more mysterious; we do not understand why the modular transformations of the SU(N) 
current algebra appear here. The formula can actually be motivated to a certain extent 
by considering ^-duality for (purely bosonic) U(l) gauge theory on Xr; in that case one 
can see that S'-duality induces a Fourier transform on the boundary conditions, a result 
similar to the large k limit of the A-dependent factor in ( |4.45| ). 



Note also that (|4.44|) tells us that the modular weight is determined by w = — (N — 1), 
which is in some agreement with our earlier formulas as N — 1 is the Euler characteristic of 
the L 2 cohomology of Xr- It is interesting that this value is independent of the gauge group. 
The value of c, however, is fractional now, and deviates from the formula c = (k — l)x that 
we found for compact four-manifolds without boundary. 



5. Computation By Physical Methods 

5.1. Reduction To N = 1 

In this section, by imitating an analogous computation for Donaldson theory ||58|| , we 
will analyze the partition function of the twisted = 4 theory (with gauge group SU(2) 
or 50(3)) on an arbitrary Kahler manifold X with H 2,0 (X) ^ 0. An answer emerges that 
satisfies an impressive set of constraints and gives support therefore to the overall picture 
of ^-duality. 

The key input, mentioned at the end of the introduction to §2, is that on a Kahler 
manifold the theory actually has four topological symmetries, one from each of the four 
underlying supersymmetries. Therefore, a perturbation that breaks the A^ = 4 super- 
symmetry down to N = 1 would still leave us with one topological symmetry, enough 
to control the theory by reducing computations to classical configurations and quantum 
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vacuum states. We can attempt to find an iV = 1-invariant perturbation whose addition 
to the theory brings about some simplification. 

The N = 4 theory can be viewed as an iV = 1 theory with three chiral superfields, 
say T, U, and V, in the adjoint representation of the gauge group. The superpotential is 

W = TtT[U,V]. (5.1) 

For the time being the gauge group is an arbitrary compact simple Lie group G. While 
preserving N = 1 supersymmetry, we could add a perturbation to the superpotential, say 
the mass term AW = — ^mTr(T 2 + U 2 + V 2 ) (or any other nondegenerate quadratic 
expression, which would have essentially the same effect). As in the resulting pertur- 
bation of the Lagrangian is the sum of a term of the form {Q, . . .} plus a BRST-invariant 
operator O of positive ghost number. In Donaldson theory, O has non- vanishing matrix el- 
ements and affects the theory in an important, but calculable, way. A simplification occurs 
for A = 4: the "vacuum" has zero ghost number (as there is no anomaly), so all matrix 
elements of O vanish; hence the partition function is invariant under the perturbation. 

Now let us find the vacuum states of the theory. First we do this classically. The 
superpotential including the perturbation is 

W = --mTv(T 2 + U 2 + V 2 ) +TrT[U,V}. (5.2) 
2 

The equations for a critical point of W are 

[U, V] = mT 

[V,T]=mU (5.3) 
[T, U] = mV. 

These equations have the following meaning: apart from a factor of m, which can be scaled 
out, they are the standard commutation relations of the Lie algebra of SU(2). To complete 
the determination of the classical vacuum states, one must divide the space of solutions of 
(|5.3|) by the gauge group G, and set the D terms to zero. Those two steps combined are 
equivalent to dividing by the complexification Gc of G. 
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The conclusion then is that the classical vacua are in one to one correspondence with 
complex conjugacy classes of homomorphisms of the SU(2) Lie algebra to that of G. For 
instance, for G = SU(m), there is a single irreducible representation - since SU(2) has 
up to conjugacy a single irreducible m dimensional representation - and various reducible 
representations. The irreducible embedding breaks the gauge symmetry completely, while 
the reducible embeddings leave various subgroups of G unbroken. At the opposite extreme 
from the irreducible solution of (|5.3| ) is the trivial embedding, with T = U = V = 0. Here 
the unbroken symmetry group is G itself. 

In what follows, we will mainly consider G to be SU(2) or S'0(3), in which case the 
only embeddings are the irreducible one and the trivial one. For other groups, there are 
other, intermediate cases. For G = SU(N) with prime N, a simplification arises: the two 
extreme cases are the only ones in which the unbroken symmetry group is semi-simple; 
the vacua coming from reducible but non-trivial embeddings all have U(l) factors in the 
unbroken symmetry group. The partition function of the twisted N = 4 theory vanishes 
for gauge group U(l) on any four-manifold, because of fermion zero modes (which cannot 
be lifted as the U(l) theory is free). This presumably means that in the generalization 
of the computation that follows to SU(N) with prime N, the intermediate vacua do not 
contribute. That would not be so for other groups. For instance, if the gauge group is 
G = SU(nm) with n, m > 1, one can consider an embedding of SU(2) in G such that 
the nm dimensional representation of G decomposes as n copies of the m dimensional 
representation of SU(2); for this embedding, the unbroken gauge group is SU(n), and 
hence one would expect the corresponding vacuum to contribute to the SU (nm) theory. 

Quantum Vacua 

Now we consider the vacuum structure at the quantum level. The trivial embedding 
gives at low energies the pure N = 1 supersymmetric gauge theory; the supermultiplets 
T, U, and V are all massive. The pure N = 1 super Yang-Mills theory is asymptotically free 
and is believed to generate a mass gap (and to undergo confinement) at low energies. For 
gauge group SU(m), the model has a Z2 m global chiral symmetry,!! which is believed to 

34 This is explicitly broken by the coupling to the massive superfields T, U, and V, but the 
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be spontaneously broken down to the Z2 subgroup that permits fermion masses. This gives 
an m-fold vacuum degeneracy, which is believed to account for the full vacuum degeneracy 
of the theory. 

The chiral symmetry and chiral symmetry breaking have the following interpretation. 
The gluino field A of the N = 1 theory has 2m zero modes in an instanton field; the chiral 
symmetry is therefore A — > e 7ri / m A. In an instanton field, one finds an expectation value 
for the operator (AA) m : 

((AA) m ) = A 3m e ie . (5.4) 

Here A is the mass scale of the theory and 9 is the 9 angle. The 9 dependence of ( |5.4|) , 
which is seen most naively from the fact that the left hand side is non-zero at the one 
instanton level, follows in a standard fashion from the quantum numbers of A under an 
anomalous U(l) symmetry of the low energy N = 1 theory. Quantum mechanically, AA 
has a vacuum expectation value; by cluster decomposition this must be an m th root of 
(U): 

(AA) r = A 3 e l ^ m e 2 " r / m . (5.5) 

Here r = 1 . . . m labels the choice of vacuum state and ( ) r is the expectation value in the 
r th vacuum. Thus, (|5.5|) shows that the correlation functions in a particular vacuum are not 
invariant under 9 — > 9 + 2-k; rather, that transformation cyclically permutes the m vacuum 
states. In a particular vacuum, the correlators are invariant only under 9 — ► 9 + 2irm. In 
the context of strong- weak duality, with modular functions of r = ^ + -pr, this means 
that the contributions of individual vacua are invariant only under 

r — > r + m. (5-6) 

Under r — > r + 1, the m vacua are cyclically permuted. 

That completes our discussion of the trivial embedding. The irreducible vacuum is 
much easier to analyze since the gauge symmetry group is completely broken at the classical 
level, so there are no strong quantum effects of any kind (if the gauge coupling is small) 
and the trivial embedding leads to a unique quantum vacuum. The intermediate vacua 

breaking is irrelevant at low energies and does not affect the following remarks. 
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can be analyzed as a combination of the above cases, with the extra phenomenon that 
because of the U(l) factors they have no mass gap. 

Thus, for instance, for G — SU(2) there are three vacuum states, two from the trivial 
embedding and one from the irreducible embedding. They all have mass gaps. The first 
two are related by a broken symmetry. If indeed w is the generator of the Z4 chiral 
symmetry, then w acts by 

w : r — > t + 1 (5-7) 

and exchanges the two vacua that come from the trivial embedding. They are each invari- 
ant under w 2 , which acts by r — > r + 2 and generates the unbroken Z2. 

5.2. Partition Functions Of Some Simple Theories 

Now we need to practice with certain general points about quantum field theory. 

If the gauge group is G = SU(2), the irreducible vacuum does not quite have com- 
pletely broken gauge symmetry; SU(2) is broken down to its center Z2. On a four-manifold 
X , the theory in the irreducible vacuum reduces at low energies to Z2 gauge theory. By 
Z2 gauge theory, we mean a theory in which one sums over Z2-valued flat connections - 
there are 2 bl of themll - and in which dividing by the volume of the gauge group means 
dividing by the number of elements in Z2, which is 2. The partition function of Z2 gauge 
theory on X is therefore 

Z = 2 bl ~ 1 . (5.8) 



(Of course, this formula has a similar origin to the prefactor in ( 3.17]) .) For a general finite 



abelian gauge group T, the result would be (#T) bl 1 , with $T the number of elements in 

r. 

Now let us consider another simple case, the partition function of a theory that has 
a mass gap and in which all degrees of freedom can be measured locally (there are no 
unbroken gauge invariances) . The mass gap ensures that a simple result will emerge if 



35 b\ is the first Betti number of X. We assume for simplicity that there is no torsion (or at 
least no 2-torsion) in the cohomology of X, so that the ordinary Betti numbers coincide with the 
Z2-valued ones. 
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one scales up the metric by g — > tg and takes t large. The contribution to the partition 
function will behave for large t like exp(— L e ff), where - because of the mass gap - L e ff 
has an expansion in local operators: 

L eff = / d 4 x^g(u + vR + wR 2 + ...) . (5.9) 

Now suppose in addition one knows for some reason that the partition function Z = 
exp(—L e ff) is a topological invariant. Topological invariance means that the operators 
that arise in (|5.9|) integrate to give topological invariants. The only topological invariants 
of a four-manifold that can be written as such integrals of a local operator are the Euler 
characteristic x an d the signature a. The contribution to the partition function of a 
vacuum with a mass gap and in which all degrees of freedom can be measured locally is 
thus of the form 

e ax+ba (5.10) 

where a and b are independent of the particular four-manifold. 

For a vacuum with a mass gap that also has an abelian group V of local gauge invari- 
ances (T is a finite group or there would not be a mass gap), the partition function is the 
product of (|5.10j ) with the partition function of the finite gauge theory: 

Z = (#r) bl - 1 e ax+ba . (5.11) 

5.3. Partition Function On A Hyper-Kahler Manifold 

Now we will begin an analysis that will lead to a proposal for the partition function 
of the SU(2) or 50(3) theory on a compact Kahler manifold X with H 2,0 (X) ^ 0. As 
in ||58|| , we first neglect the difference between the physical and topological theories. This 
means that we will obtain formulas that are really valid only for hyper-Kahler manifolds 
(of which there are very few examples) since in the hyper-Kahler case the physical and 
topological theories coincide. Then we make a correction involving the twisting and the 
canonical divisor of X to understand the general case. 

Because the various vacua of the twisted theory all have mass gaps, we can use ( |5.11| ) 
with r trivial for the trivial embeddings and r = Z2 for the irreducible embedding. For 
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hyper-Kahler manifolds, the parameters o and b in (|5.11| ) cannot both be detected. The 
reason is that on a four-dimensional Kahler manifold, the canonical divisor K obeys 



K ■ K = 2 X + 3a. (5.12) 

(Here K ■ K is the intersection pairing.) In particular 2% + 3a vanishes for hyper-Kahler 
manifolds, which have K = 0. Therefore, as long as we are on hyper-Kahler manifolds, 
we can only see one linear combination of x an d a (the other will appear when we make 
corrections involving the canonical divisor). It turns out that particularly nice answers 
emerge if we choose the combination 

v=^-. (5.13) 

which is always an integer for Kahler manifolds. As long as the twisting can be ignored, 
the contribution to the partition function from a vacuum state with a mass gap should 
be e av with a universal a (times a factor involving the discrete gauge symmetry). In ||58|| , 
as there was no dimensionless coupling constant, a was simply a constant. In our present 
problem, there is a dimensionless coupling constant r and a is a function of r. 

For the SU(2) theory, there are three vacuum states, all with mass gaps, so the 
partition function ignoring the twisting would therefore be 

Z = 2 b ^- 1 e a{T)u + e b{T)v + e c{T)u , (5.14) 

where the first term is the contribution from the irreducible embedding (with its unbroken 
Z2 gauge invariance) and the last two terms are the contributions from the two vacua that 
come from reducible embeddings. Further, the contribution of the irreducible vacuum is 
periodic in 9, while the other two are related by a broken symmetry, so 

e o(r+i) _ e a(r) 

e 6(r+l) = r l e c(r) ( 5 _ 15 ) 
e c(r+l) _ ^-l e 6(r) 



Moreover a and b are such that the sum in (|5.14|) transforms as a modular form for an 



appropriate subgroup of SX(2, Z) as explained in §3. Only the factors of i 1 on the right 
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hand side of ( |5.15| ) may need special explanation here. The reason for these factors is 
that, although the bulk theory is invariant under the Z4 symmetry generated by w, on a 
four-manifold X the path integral measure transforms with a factor of i u ; this can be seen 
by counting A zero modes and is explained in the derivation of equation (2.46) of []58| (in 
comparing to that equation note that A = v modulo 4). 

Comparing to the answer found for K3 for SU(2) gauge group in §4.1, we see that it 
is naturally written in precisely this form, with 

e "M =-G(q 2 ^ 2 = 



e 



2 " ' 2*11^(1-5 



2nU2 



'2^ ) 2gV4J]- (!_gn/2)12- ( 5 - 16 ) 



e c(r) =1^(^1/2)1/2 1 



2g 1/4 n™i(i-(-i)T/ 2 ) 12 

Since v = 2 for K3, the signs on the right hand side of ( |5.16| ) are undetermined, and have 
been selected for later convenience.il Of course, this identification of a, 6, and c is not 
really rigorous, but it is so natural that we will accept it. Notice that - since e a , e b , and 
e c cannot have zeroes - the fact that the rj function has no zeroes in the upper half plane 
is essential for the formula to make sense. Also, the factors of i on the right hand side 
of (|5.15|) come from the leading powers g -1 / 4 in e b and e c (e a is instead strictly invariant 



under r t + 1 as the leading power of q is integral) . 
For SU (m) the analog of (|5.14|) would be 



Z = m hl - x e a{r)v + e hl{T)v + e h ^ r)v + ... + e hm{r)v ', (5.17) 

where the first term comes from the irreducible embedding and the other m terms from 
the trivial embedding. In addition to modular invariance of the whole sum, the individual 
terms should obey 

e a(r+l) _ e a(r) 

(5.18) 



e & r (T+l) _ e -iir / m e b r+1 (r) 



36 It would be possible to multiply this formula and all subsequent ones that involve a, b, and 
c by a factor of (— 1)", which we are unable to determine since the available computations are for 
manifolds with even v. 



81 



The proposal for the SU(m) partition function on K3 that was written at the end of §4.1 
is naturally written in this form with obvious choices of a and the b r . (|5.17|) should be 
valid for m prime; otherwise, intermediate vacua contribute, as explained above. 



5.4- Effects Of Twisting 

Now we want to consider what happens on more general Kahler manifolds with a 
non-trivial canonical divisor. The physical and twisted models no longer coincide, and the 
twisting has the following effect. Of the three superfields T, U, and V, two, say T and U, 
are scalars in the twisted model and the third, V, is a (2, 0)-form. (This is already clear 
from the description of the quantum numbers in §2.4.) Moreover, the superpotential W 
now transforms as a (2, 0) form on X (as in equation (2.42) of ||58|| ). 

Given the quantum numbers, a mass term combining V and one of the other super- 
fields, say U, can be written without any difficulty: W\ = —mTrUV. However, to give 
a mass term to all three superfields, one needs as in ||58|| to pick a c-number holomorphic 
two-form uj on X. (So, in particular, H 2,0 (X) must be non-zero.) For instance, once such 
an u is picked, one can give T a spatially dependent mass term 

W 2 = -wTrT 2 . (5.19) 

We now consider the effects of this perturbation. Away from the zeroes of u, the 
superpotential W\ + W2 can be analyzed just as we have done above, leading to the 
familiar classification of vacuum states in terms of SU(2) embeddings. Near zeroes of u, 
the picture changes. In analyzing this, we will consider only gauge group 577(2) or SO(3), 
so in the bulk theory we have only the trivial and irreducible embeddings to consider. 
Both exhibit new behavior near zeroes of u, but in very different ways. In working out the 
details, we assume, for simplicity (as in ||58|1 ), that the zeroes of u are a union of disjoint, 
smooth complex curves C{, i = 1 . . . n of genus <?j, on which uj has a simple zero. 

The Trivial Embedding 

First we consider the strongly coupled vacua that come from the trivial embed- 
ding. The Ci behave like the world-sheets of "superconducting cosmic strings" that can 
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trap charges (see §3.3 of ||5S|1 ). Even after understanding in bulk the dynamics of the 
strongly coupled four-dimensional theory, one must analyze the behavior of the effective 
two-dimensional theory near Cj. 

The main points that one can make come from the symmetry structure (see §2.7 of 
||58H ). In bulk, the theory has a mass gap with Z4 spontaneously broken down to Z2. 
This Z2 is realized near the Cj as a chiral symmetry that prevents fermion masses in the 
effective two-dimensional theory, so it is natural that it should be spontaneously broken, 
giving a two-fold vacuum degeneracy. It is natural to assume that this strongly coupled 
vacuum state has a mass gap and no vacuum degeneracy except what follows from the 
symmetry breaking. 

The vacuum structure (for states coming from the trivial embedding of SU(2)) is 
thus as follows. In bulk there are two vacuum states, say |+) and |— ), related by the 
Z4 — ► Z2 symmetry breaking. Near any of the Cj there is a further two-fold bifurcation of 
the vacuum; |+), for instance, splits into | + +) and | H — ). 

According to ( 5.7|) , the generator w of the underlying Z4 symmetry acts on the con- 
tributions of these vacua to the partition function by r — > r + 1. The Z2 that is unbroken 
in bulk is generated by w 2 , which acts by r — > r + 2. Hence the | + +) and | H — ) vacua 
are exchanged by r — > r + 2. This also exchanges | — h) with | ). 

Now, consider a particular vacuum, say | + +), along a cosmic string component Cj. 
Its contribution to the partition function is again of the form exp(— L e ff), where L e ff 
has an expansion in local operators as in ( |5.9| ). The only difference is that now they 
are two-dimensional local operators since we are determining the contribution localized 
near Cj were the bulk description fails. The only topological invariant of Cj that can 
be written as the integral of a local operator (indeed, its only topological invariant) is 
its Euler characteristic x(C) = 2 — 2gi. Hence the contribution of | + +) near Cj is a 
factor of e*- 1-9 ^^ 1 "), where the function u(r) is independent of the details of X and Cj. 
Similarly, the | H — ) vacuum contributes a factor e^ 1 ~ 9i ' v ^ T \ where u and v are exchanged 
by r -> r + 2: 

u(t + 2) = v{t) 

(5.20) 

u(t + 4) = u(t). 
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With all its possible bifurcations, which must be chosen independently on each Cj, the |+) 
vacuum contributes a factor of 



]J [e u{T){l ~ 9i) + Ue v{T){1 - 9i) ^j . (5.21) 



i=l 

The factors of ti = ±1 have the same origin as the factors of z _1 on the right hand side of 
( |5.15|) - they incorporate a global anomaly (in the coupling to gravity) in the Z2 symmetry 
that permutes | + +) and \-\ — ). Similarly the possible bifurcations of the |— ) vacuum 
contribute the transform of this under r — > r + 1, or 



r [[ L«(T+l)(l-g t ) + t , e v(r+m- 9i )\ (522) 



i=l 

Now, ( |5.12|) is equivalent to the formula 



2X + 3a = ^(^-1). (5.23) 

i 

This formula is the real reason that we need not in the bulk theory consider an extra factor 
of the form exp(7(r)(2% + 3cr)). It could be absorbed in redefining the function u. 

The Non-trivial Embeddings 

Now we analyze what becomes of the non-trivial embeddings when the mass terms 
involve a choice of holomorphic two-form. What this means is that we consider the theory 
with the combined superpotential 

Wtot = -wTrT 2 -mTrUV + Tr T[U,V], (5.24) 

and look for a classical solution with zero action in the twisted theory. The analysis is 
feasible because by arguments similar to those in §2.4, the (2, 0) part of the curvature can 
be assumed to vanish, and the structure is therefore holomorphic. 
The condition for a critical point of Wtot is 

[T, U] = mU 

[T, V] = -mV (5.25) 
[U, V] = 2uT. 



37 This factor appears in equation (2.66) of [ |58f and is explained in the derivation of that 
equation. 
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These equations have the following immediate consequence. Regardless of what ui does, 
the matrix T is at each point in X conjugate to 



m (1 

2 -1 



(5.26) 



and in particular never vanishes. Thus T breaks the gauge group down to U(l). 

For the kinetic energy of T, U, and V to vanish in the twisted theory does not require 
that they should be covariantly constant, but only that they should be holomorphic. The 
existence of a holomorphic T that is everywhere in the conjugacy class indicated in ( |5.26j ) 
means that the SU(2) gauge bundle E splits as a sum E = L©L _1 with some holomorphic 
line bundle L. With such a splitting, T then globally takes the form of ( |5.26| ). ( |5.25| ) then 
implies that U and V are of the form 



U 
V 

with 



b 
. 

(5.27) 

x 
c 



be = mu). (5.28) 

Here b is a holomorphic section of the line bundle L 2 , and c is a holomorphic section of 
the line bundle K ® L~ 2 . (K enters because V is a (2, 0)-form.) 

In order to make the exposition in the rest of this section as simple as possible, we will 
assume first that the zeroes of ui consist of a single smooth connected curve C, on which 
io has a simple zero. There is then only a single U in (|5.21| ) and (|5.22 ), and according to 
||58|| , equation (2.61), it is 

t=(-iy. (5.29) 

At the very end of the section we will return to the case that the canonical divisor is a 
union of disjoint smooth curves Cj. 

With the given assumption about u, ( |5.28| ) is easy to analyze. The product be vanishes 
with multiplicity one on the irreducible smooth curve C, so either c vanishes on C and 
b has no zeroes, or vice-versa. If b has no zeroes, the line bundle L is trivial, and up to 
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a complex gauge transformation, b = m and c = u. If c has no zeroes, then K © L~ 2 is 
trivial, so L = K 1 / 2 (that is, L is a line bundle with L® 2 = K) and up to a complex gauge 
transformation, b = uj and c = m. 

Once the holomorphic data are known the metric on the gauge bundle is determined 
(presumably uniquely, by a convexity argument, but certainly not explicitly) by the con- 
dition (analogous to equation ( 2.67|) in §2.4) for the (1, 1) part of the curvature. 



Thus, we have determined the possible vacuum solutions: there is one on the trivial 
bundle Eq, which has instanton number zero, and one on the bundle E x = K 1 / 2 © K~V 2 , 
whose instanton number is 

K-K g-l 

— — = — t> ( 5 - 3 °) 

with g the genus of C. Note that this number is typically negative. Actually, we saw in 
§2.4 that bundles of negative instanton number can contribute when the vanishing theorem 
fails, and we saw (under a hypothesis that was equivalent to having the canonical divisor 
connected and K ■ K > 0) that the most negative possible value of a bundle that would 
contribute was —K-K/4. 

Of course, it might happen that X is not a spin manifold. Then K 1 ! 2 does not 
exist. In that case, only Eq will contribute to the SU(2) theory. However, E\ will always 
contribute in the SO(3) theory. Indeed, the SO(3) bundle ad(i?i) derived from E±, which 
is K © K~ x © O (with O a trivial bundle) always exists. Its second Stieffel- Whitney class 
is ^2 ( ad ( £i)) = W2(X), where W2(X) is the second Stieffel- Whitney class of the tangent 
bundle of X, which is the same as the reduction modulo two of C\(K). Indeed, ad(-Ei) is 
the bundle of self-dual two- forms, while E\, if it exists, is one of the two chiral spin bundles 
of X; the obstruction to constructing the latter from the former is W2(X). 

The General Structure 

The general structure of the partition function is then as follows, for the SU(2) theory: 

Zsu&) =2 bl "V^ (F + G6 W2(X)=0 ) 

+ e h{r)v ^ 1 ~ 9)u( - T) + (-l)^e (1 - ff)u(r+2) ) (5.31) 
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(The factors of (-1)" come from the U in (|5T2T|) and ( jTgg) via (|5T29|) .) F and G are 



corrections near the canonical divisor to contributions of the vacuum bundles Eq and E%] 
of course £1 only contributes to the SU(2) theory if ui2(X) = 0. In what follows we 
abbreviate xq = W2(X). The functions a, 6, c, and u are universal (independent of X) 
because of the mass gaps. It might be possible to argue a priori for a similar universality 
of F and G,il but in any case S'-duality implies such universality. 

Now given ( |5.31| ), we want to determine the analogous formula for the 50(3) theory. 
In the SO (3) theory, one must sum over all values of W2(E). One can nearly interpret ( |5.31| ) 



as the contribution of bundles with W2(E) = to the SO (3) theory, but it is necessary 
to make a correction that involves comparing the volumes of the SU(2) and SO (3) gauge 
groups, as discussed in §3. So to get from ( |5.31| ) the contribution to the SO (3) theory 
from bundles with W2(E) = 0, one simply divides by 2 bl ~ 1 . 

Once this is known, there is no problem getting the contribution in the SO (3) theory 
from bundles with an arbitrary value of x = 102(E) . The F and G functions anyway 
only contribute for a particular value of x, and the others because of the mass gap only 
see the global object x through its influence on anomalies which determine the relative 
phases between contributions of different vacua. The x-dependence of these anomalies is 
explained in [58| in the derivation of equation (2.79). Putting together the anomalies and 
the factor of 2 _&1+1 , the contribution in the SO (3) theory of bundles with u>2(E) = x is 

Z x =e< r > {FS X=0 + GS X=X0 ) 

+ i~ x2 2 l ~ bl e c ^ v ^ e ( 1 -9) u ( T + 1 ) _|_ ^_iy+x-x e (i- g )u(T+3) \ ^ 

Of course, the F function, associated with the vacuum bundle Eq, only contributes for 
x = 0, and the G function, associated with the vacuum bundle E\, only contributes if 
x = Xq = w 2 (X). 

The unknown functions can be determined as follows. According to the blowing-up 
formula of Yoshioka, blowing up a point in a Kahler manifold Xq multiplies the partition 



38 It is not clear whether for the irreducible vacua, the weakly coupled theories that arise near 
the canonical divisor have mass gaps. 
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function by a factor of Oq/t] with r\ the Dedekind eta function and 

6 = J2 ( l n2 - ( 5 - 33 ) 
nez 

(In this assertion, x is taken to be a pullback from Xq.) If we consider the special case 
that Xq is K3 (so the partition function on its one-point blow-up X is governed by ( 5.32j ) 
with g = 0, v = 2, x ■ xq = 0)JH this implies that 

F(t) = e u ^ + e u ^ = e^+V + e u ^ = (5.34) 

The function 9o/r] 2 transforms in a two-dimensional representation of SL{2, Z), the other 
function that enters being 9i/rj 2 with 

9i= ^- ( 5 - 35 ) 

Under r — >• — 1/r, F and G are mapped to e u ^ and e u ^ T+2 \ respectively, as will be 
clear presently when we check the modular transformation laws in detail. Since F is known 
(at least in this special case) e u ^ is thereby determined - once and for all, since u is a 
universal function. By applying r — > one now determines F and G in general. In 

particular, one finds that G ~ (^i/^ 2 ) 1-3 ~ q^~ 9 ^ 4 (l + . . .)rf where the leading exponent 
^(i-s)/ 4 j s j n happy agreement with the instanton number of the classical solution that is 
responsible for the presence in the formula of the G function. 

In the following subsection, we will write down a precise formula that was found by 
the reasoning just sketched and verify that it has all of the right properties. The reader 
who works through the verification should be able to see that given the general structure 
that we have proposed, the formula we write down is the only one that works. 

5.5. The Formula 

We continue to assume temporarily that the canonical divisor of X is connected; its 
genus is 

g - 1 = 2 X + Sa. (5.36) 



39 Recall from the end of §2.4 that the vanishing theorem holds for the blowup of K3 - so we 
can apply instanton results such as those of Yoshioka. Similarly, we can consider K3 with an 
arbitrary number of points blown up. 
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b 2 + and b 2 will denote the dimensions of the spaces H 2 > + and H 2 ' of self-dual and 
anti-self-dual harmonic two- forms on X; then we have 

b 2 = b 2 + + b 2 ~ , 

a = b 2 +-b 2 ~ (5.37) 
X = 2-2b! + b 2 + + b 2 ~ 7 

with b 2 the second Betti number of X. 
The formula we propose is then 

Z.= gctf))" 72 (*.,o(-l)" (^) 1_9 + ^o (^)^ 

+ 2^ ( W^Y 72 ( ( e JL±li) 1 ~ 9 + { _ ir +*-x f^-e^ 



4 v " 7 \ V 2r] 2 J v ' V V 



'(5.38) 



This formula manifestly transforms correctly under r — > r + 1. According to the 
precise version of the strong-weak duality conjecture proposed in §3.3, the transformation 
law under r — > — 1/r should be 



Z tf (-l/r) = 2-^/ 2 (-l)^(^)-^ 2 ^(-l)-»Z x (r). (5.39) 



To evaluate the sum over x on the right hand side, we need various formulas: 

£(-ir^, = i 

X 

E(-ir^ xo = (-i)v* 

X 

j2(-ir y = 2 b2 Ko 

X 

J2(-l) X - y + X ' X0 = Z^dy^ 
X 

j2(-ir y i~ x2 = 2 b ^ 2 i- a ' 2+y2 

X 

x 
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(5.40) 



>£Co 



The only formulas here that require comment are the last two. The next-to-last sum in 
( |5.40|) , on completing the square, is Y^ x i~( x+ y^ +y = %y ^ x i~ x , and so is equivalent to 

2 

the sum ^2 x i~ x > which is a special case of a sum that we met in §3.3. The last sum in 
( |5.40| ) can be treated the same way; alternatively, using the Wu formula, 

(-1) X - X0 = (-l) x \ (5.41) 

the last sum is the complex conjugate of the one before. 

Using these sums, we find that the right hand side of ( |5.39|) is (dropping the modular 
weight) 

R . H .S. = 2-V. (^!>) ^ + (-1)— (£) ^ 



4 J \ v " \ 2rf ) \ 2ry 

(5.42) 

To verify ( |5.39| ), we need the modular transformations of the various functions. Under 
r — > — 1/r, we have (apart from modular weight) 



e 1 J 72 V 1 -V V*i 



(5.43) 



This means, in particular, that 6q is exchanged with (6q + Q\) / \/2 and 6>i with {6q — #i)/\/2, 
while (6>o±i#i) is mapped to i ±1 / 2 (6'o =F^i). These facts are convenient, since the functions 
just mentioned all appear raised to the (1 — g) power in ( |5.38|) . We also have (apart from 
the modular weight) 

G{q l / 2 yl 2 - 2-^ + ^G(q 2 y/ 2 (5.44) 

G{ _ q l/2y/2 _^ G( _ ? l/2 r /2_ 

With the aid of these transformation laws and identities ( |5.37|) and (|5.41|) , it is straight- 
forward to verify (|5.39| ). 
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Disconnected Canonical Divisor 

It remains only to generalize the above formulas for the case that u vanishes (with 
multiplicity one) on a union of disjoint smooth components Cj, % = 1 . . .n, of genus gi. 

This has two effects: the bulk vacua |+) and |— ) that come from the trivial embedding 
of SU(2) bifurcate along the Q into 2 n separate ground states - in a fashion that we 
have already described in ( |5.22| ) and ( |5.21| ), before we specialized to n = 1. Also, the 



contribution of the irreducible embedding is more complicated. 

It is straightforward to write down the contribution of the trivial embedding since we 
now have determined all of the functions that appear in ( p. 22 ) and (|5.21 ). The contribution 
is 




2^ J + ^ l > \ 2r?2 



(5.45) 



The ti obey 



n*i = (-ir> (5-46) 



as shown in equation (3.57) of ||58|1 . Also, rii( — l) x Ci = (—l) x ' x ° as the union of the Cj is 
the canonical divisor, whose Poincare dual reduces modulo two to Xq. 

To find the contributions of the irreducible embedding, we must reexamine the fac- 



torization be = mu of equation ( [5.28 ). The possible solutions are as follows. Given that 



uj vanishes with multiplicity one on the union of the Cj, b can vanish with multiplicity 
one on any subset of the C{, with c vanishing on the others. If then Lj = 0{Ci) is the 
line bundle whose sections are functions with a simple zero on Cj, the SU{2) bundle E is 
= L// 2 © L e - _1/2 , where 

L ? =®? =1 VS (5.47) 

where = Oor 1 are chosen independently. There are thus 2 n solutions in all (a fact which 
under SL(2, Z) transforms into the fact that the |+) or |— ) vacuum in bulk bifurcates into 
2 n choices along the cosmic strings). Of course, as may not have a square root, the 
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bundles may not really exist as SU(2) bundles, but the corresponding SO (3) bundles 
ad(EV) always exist - with second Stieffel- Whitney class 

«j 2 (e) = X) < * ( 5 - 48 ) 

i 

Here [C»] is the reduction modulo two of the Poincare dual of Of, one has W2(X) = J^JCj]. 
The instanton number of the bundle E? is 

-£^. < 5 - 49 > 

i 

The contribution of all 2 n solutions of be = mui is 




(5.50) 



This formula was found by requiring that it is zero except if x is equal to u>2(e) fo r some 
e and that the sum of ( |5.50|) and (|5.45| ) transforms correctly under r — ► — 1/r. That last 
assertion can indeed be verified using the identities that have already been exploited above. 
Note also that the smallest power of q in the contribution of a given solution agrees with 
the instanton number in ( |5.49|) . 

6. Connections with RCFTs and Strings 

In this paper, especially in §4, formulas familiar in the context of two-dimensional 
rational conformal field theories (RCFTs) and strings made unexpected appearances. In 
this section we discuss some aspects of this mysterious phenomenon. We will first consider 
relations to RCFTs and then speculate about the potential applications to the question of 
5-duality in string theory. 

We have seen in §4 that blowing up of a point has the effect of multiplying the 
N = 4 partition function with 577(2) gauge group by essentially the level one characters 
of the 577(2) WZW model. The magnetic flux vector labels the conformal blocks, and the 
instanton number mod 1 in the gauge theory corresponds to the conformal dimension mod 
1 in the RCFT. Moreover we saw that the level k characters of SU(n) arise for N = 4 
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super Yang-Mills theory with gauge group U(k) on ALE spaces A n —\. One might think 
that the appearance of conformal field theory characters simply reflects the fact that those 
objects are modular and that there are not too many modular objects of low weight and 
level. However, the connections go beyond the partition function. For the ALE spaces, for 
instance, Nakajima does not just get a WZW character but finds an action of the affine 
Kac-Moody algebra on the Hilbert space consisting of the cohomology of instanton moduli 
spaces. 

Generalizing the structure found by Nakajima, one might wonder, for instance, 
whether for a more general four-manifold there is a natural action of the Virasoro al- 
gebra on the cohomology of instanton moduli spaces if one consider all instanton numbers 
at once. And what is the analog of the two-dimensional operator product expansion? Can 
one find for each rational conformal theory a (possibly non-compact) four-manifold whose 
N = 4 twisted partition function gives the characters of that RCFT? 

In two-dimensional RCFT's, the conformal blocks Xi form a representation of 
SL(2, Z), much in the same way as do our partition functions Z v for v G H 2 (M,Z n ). 
RCFT's have additional structure: From the underlying operator-product relations one 
deduces a commutative, associative multiplication law called the Verlinde algebra on the 
space of blocks |^9[. Does it have an analog in our problem? Since the Verlinde algebra 
is determined by the S matrix, and since the four-dimensional problem has an S ma- 
trix given in ( |3.15|) , we can deduce that the analog of the Verlinde-algebra should be the 
multiplication law 

" Z^j 'Zl Vl -\-y 2 . 

This operation is indeed commutative and associative (it is associated with the ordinary 
addition law of the Z n -valued flux) further enhancing the analogy between four dimensional 
gauge theories and rational conformal field theory. 

Stringy Spectrum? 

Before going on, let us recall some aspects of the tests of S'-duality in string theory || . 
One basic test is to identify all the BPS-saturated states with given "electric" charges and 
find what "magnetic states" they lead to under SL(2, Z) transformation. One can then 
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try to see if these states exist. This is usually done for the field theoretic modes. However, 
in string theory there is actually a full stringy spectrum of massive BPS-saturated states. 
For heterotic strings the number of such states is given by the (left-moving) bosonic string 



oscillator partition function |6(J . To see this one notes that if one takes the supersymmetric 
oscillators to be in the ground state, then regardless of the lattice momenta, one gets a 
"small" supersymmetry representation corresponding to BPS-saturated states 0. Thus 
all the states which come from Nr = and arbitrary Nl saturate the BPS bound and, if 
S'-duality is to hold, there must be corresponding magnetic states. For Nl = 0, these states 
are the BPS monopoles that can be seen in field theory. For Nl = 1 it was suggested in 
that they have the right quantum numbers to be identifiable with so-called iif-monopoles 
[|6l]J)2|. Based on the N = 4 structure, and the fact that if-monopoles saturate the BPS 
bound, one expects the moduli space of if-monopoles to have a hyper-Kahler structure. 
Moreover the "magnetic" states would correspond to the cohomology of this moduli space. 
Unfortunately this moduli space is little understood. However, K3 was proposed as a 



candidate in [|36[], in part because it is four dimensional and hyper-Kahler and has 24 



dimensional cohomology. The fact that K3 has 24 dimensional cohomology matches the 
fact that the bosonic string at Nl = 1 has 24 physical states (namely a^-jO), where i runs 
over the transverse directions in the light-cone gauge). Moreover, the Lorentz spin of these 
24 states agrees with that of the cohomology of K3 if we identify the helicity operator in 
space-time with (Fl + Fr)/2, where we define an element of the cohomology group H p,q 
to have Fl = p — 1 and Fr = q — 1. Indeed, there are 22 states with helicity 0, one with 
helicity +1 and one with helicity —1.0. 

Is there an analog of this for Nl > 1? The discussion in §4 makes clear a possible 
analog: the string states at arbitrary Nl can be associated with the cohomology of the 
symmetric product of Nl copies of K3\ The helicities work out correctly, just as they 
do for K3. Note that the main thing needed for K3 to give the bosonic string partition 



40 If we compactify to six dimensions, there are two light cone helicity operators; identifying 
them with (Fl + Fr)/2 and (Fl — Fr)/2, the massless states of the bosonic string compactified 
to six dimensions agree with the cohomology of K3. This comment also applies to the type II 
superstring considered above. 
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function is its Euler characteristic (and the hyper-Kahler structure which enables one to 
use the orbifold formula to compute the cohomology of the symmetric product). So it is 
conceivable that K3 could be replaced here by another, perhaps non-compact, space. 

If one compactifies a Type II superstring on a six-torus, there is again a stringy 
spectrum of BPS-saturated states, this time in 1-1 parallel with fermionic oscillator states. 
Is there any way to generate the oscillator states of a fermionic string by taking the 
cohomology of a symmetric product? The N = 4 supersymmetry again suggests that the 
relevant moduli space would be a hyper-Kahler maniflold. In dimension four, other than 
K3 there is a unique compact example, namely the four-torus T 4 . The cohomology of T 4 
is 16 dimensional, with 8 "bosonic" states (of even degree 0, 2, or 4) and 8 "fermionic" ones 
(of odd degree 1 or 3). This agrees with the fact that the fermionic string has 8 transverse 
Bose oscillators and 8 Fermi ones. If we take the helicity operator to be (Fl + Fr)/2, 
then the bosonic states in the cohomology of T 4 include six states of helicity zero and 
two of helicity ±1 while the Fermi states include four of helicity 1/2 and four of helicity 
— 1/2. This agrees with the quantum numbers of the transverse oscillators of the string! 
The cohomology of a symmetric product of T 4, s can be analyzed by the same orbifold 
techniques that we used for K3. It is a Fock space derived from "one-particle states" 
which are the cohomology of T 4 , and so agrees with the fermionic string spectrum. 

While we have little insight to offer at the moment, this relation of the oscillator 
spectrum of bosonic and fermionic strings to the cohomology of the two compact hyper- 
Kahler manifolds in four dimensions is certainly provocative. 
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